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1 Introduction 


In recent years the study of linguistic diversity took center stage in linguistic typology 
(e.g., Evans & Levinson 2009). Nettle (1999: 10) usefully differentiated between three 
types of linguistic diversity that he called language diversity (the number of languages), 
phylogenetic diversity (the number of language families), and structural diversity (gram- 
matical differences among languages). This study is concerned with all three kinds of 
diversity, but places an emphasis on the last. In this it follows Nichols (1992: 2), who 
postulated that "the main object of description here is not principles constraining pos- 
sible human languages but principles governing the distribution of structural features 
among the world’s languages.” Different from a classical and purely synchronic typolog- 
ical study based on a well-balanced global sample of languages, this study openly seeks 
the areal and genetic bias and investigates the distribution of linguistic and especially 
of structural diversity in Northeast Asia (NEA). Because "typological distributions are 
historically grown" (Bickel 2007: 239), this study emphasizes the internal development 
in individual language families as well as their mutual relations. 


The ultimate goal is to understand ^what's where why?", and this makes it clear 
that the major contributions that typology offers are not confined to Cognitive Sci- 
ence as narrowly understood. The goals of 21st century typology are embedded in a 
much broader anthropological perspective: to help understand how the variants 
of one key social institution are distributed in the world, and what general prin- 
ciples and what incidental events are the historical causes for these distributions. 
(Bickel 2007: 248, my boldface) 


Bickel (2015) today calls this approach distributional typology. Nichols (1992), based on an 
analogy with biology, employed the term population typology instead. Dahl (2001: 1456) 
prefers yet another name, areal typology, defined as "the study of patterns in the areal 
distribution of typologically relevant features of languages" that “is both descriptive and 
explanatory” and “has both a synchronic and a diachronic side" What these approaches 
have in common is not only their focus on the distribution of diversity, but also the desire 
to explain its emergence. 

The holistic approach taken in this study can be tentatively characterized as an ecologi- 
cal typology that is committed to an ecologically plausible understanding of language and 
human beings (Hólzl 2015b: 186). However, in linguistics ecology can be understood in a 
variety of different ways. So-called ecolinguistics, for instance, according to one view "is 
the study of the impact of language on the life-sustaining relationships among humans, 
other organisms and the physical environment” and “is normatively orientated towards 
preserving relationships which sustain life? (Alexander & Stibbe 2014: 105) In another 
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sense, the ecological aspect instead refers to the maintenance of languages and ensuing 
preservation of linguistic diversity (e.g., Mithlhausler 1992). The approach followed here 
is less value-driven (Hólzl 2015b: 173f.); it concentrates instead on the description and 
explanation of linguistic diversity. While it shares this focus with the other approaches 
mentioned above, it emphasizes the importance of ecology for an adequate understand- 
ing of language. The fundamental unit of description is the organism-environment sys- 
tem, or OES for short (e.g., Turvey 2009; Welsch 2012). According to Járvilehto (1998: 
329), the theory of the OES maintains "that in any functional sense organism and envi- 
ronment are inseparable and form only one unitary system. The organism cannot exist 
without the environment and the environment has descriptive properties only if it is 
connected to the organism.’ This theory has a relatively long history, which is concisely 
summarized in Járvilehto (2009). For example, Sumner (1922: 233) employed the term 
organism-environment complex instead, but similarly claimed that "the organism and 
the environment interpenetrate one another through and through? However, Jarvilehto 
(2009) did not mention a very similar concept called the life space advocated by Lewin 
(1936: 12): "Every scientific psychology must take into account whole situations, i.e., the 
state of both person and environment.” Language, it will be argued, is an integral compo- 
nent of the human OES. Language is not restricted to the organism (e.g., the brain), but 
equally has an existence as a self-constructed niche (Odling-Smee & Laland 2009; Sinha 
2013), i.e. a modification of the environment by an organism such as the web of a spider 
or the dam of a beaver (Odling-Smee et al. 2013: 5). 


Niche construction refers to the modification of both biotic and abiotic components 
in environments via trophic interactions and the informed (i.e., based on genetic 
or acquired information) physical ^work" of organisms. It includes the metabolic, 
physiological, and behavioral activities of organisms, as well as their choices. 


Human niche construction encompasses a multitude of different examples, ranging from 
the use of tents such as the Evenki d’u (similar to a tipi), over the domestication of rein- 
deer, the construction of railroads, or deforestation, to human-induced climate change. 
In fact, given the extraordinary impact of humans on the environment, the term Anthro- 
pocene has been suggested as the contemporary geological epoch (e.g., Rosol & Renn 
2017 and references therein). The hypothesis that language is an integral component of 
the organism-environment system has important consequences for the understanding 
of linguistic diversity. Of course, linguistic diversity is neither scattered at random, nor 
is it without limits. Rather, there must be a reason for the distribution of linguistic diver- 
sity we find today (Bickel 2014; Bickel 2015: 904f.). However, a distinction between syn- 
chrony and diachrony is insufficient as a proper explanation. One of the most promising 
approaches to the natural causes of language has recently been put forward by (Enfield 
2014: 13ff.), who distinguishes between a total of six causal frames in which linguistic 
processes occur. 


Each of the six frames — microgenetic, ontogenetic, phylogenetic, enchronic, di- 
achronic, synchronic - is distinct from the others in terms of the kinds of causality 


it implies, and thus in its relevance to what we are asking about language and its 
relation to culture and other aspects of human diversity. One way to think about 
these distinct frames is that they are different sources of evidence for explaining 
the things that we want to understand. (Enfield 2014: 13) 


These causal frames are related to, but not quite identical with, different time scales, 
ranging from milliseconds to millions of years (Table 1.1). There is a certain amount of 
mutual interdependence and influence between these frames, each of which combines 
properties of both organism and environment to different degrees. Niche construction, 
for example, may exist at several time scales and can “accumulate over time” (Odling- 
Smee et al. 2013: 18). 


Table 1.1: Examples of causal frames loosely based on Enfield (2014: 13-17) with 
a focus on language 


Frames Timescales Examples 
phylogenetic ky-my biological evolution, climate change, language 
evolution 


diachronic -k 
TOM language change, language families, 


conventionalization 

ontogenetic m-y individual biography, language acquisition, 
entrenchment 

enchronic s-m 
turn-taking, conversation, question-response 
sequences 

microgenetic | ms-s physiological processes, action, perception, 
conception 

synchronic = language systems, knowledge of a given language 


All of these frames are crucial to an explanation of linguistic diversity, although a 
focus will be on some of them. Originally, linguistic typology was mostly concerned 
with the synchronic dimension, which is a necessary abstraction to consider individual 
languages as fixed entities that can be described and compared. The diachronic frame 
primarily concerns language change over a period of years or thousands of years. This 
study in particular investigates what will be called the grammar of questions (GQ), i.e. 
those aspects of any given language that are specialized for asking questions or regularly 
combine with these.! The ability to ask questions as well as the existence of specialized 
constructions for asking questions seem to be universal. Questions, of course, are part of 
question-response sequences, which are located in the enchronic frame that refers to so- 
cial interaction. Most theoretical discussions of questions, from a speech act perspective 


!Cable's dissertation has the title The grammar of Q (Cable 2007). However, the term itself has not been 
clearly defined and is grounded in generative grammar. 
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for example, concentrate on this frame (e.g., Levinson 2012a). Exceptions include psy- 
chological studies (e.g., Loewenstein 1994) or the so-called cognitive typology approach 
by Schulze (2007), which also include the microgenetic frame. As opposed to the social 
dimension of the enchronic frame, the microgenetic perspective concentrates on the cog- 
nitive and physiological processes that take place within the organism-environment sys- 
tem. The emergence of the grammar of questions over phylogenetic (human and linguistic 
evolution) and ontogenetic time-spans (individual development, especially of children), 
as described by Tomasello (2008), will not play an important role in this study. 

Apart from the causal frames, it is important to add different loci of causes, which can 
be described metaphorically as different types of ecology that language is embedded in. 
A recent classification proposed by Steffensen & Fill (2014: 7) distinguishes between four 
different ecologies: 


(1) Language exists in a symbolic ecology: this approach investigates the co-exis- 
tence of languages or ‘symbol systems’ within a given area. (2) Language exists 
in a natural ecology: this approach investigates how language relates to the bio- 
logical and ecosystemic surroundings (topography, climate, fauna, flora, etc.). (3) 
Language exists in a sociocultural ecology: this approach investigates how lan- 
guage relates to the social and cultural forces that shape the conditions of speakers 
and speech communities. (4) Language exists in a cognitive ecology: this approach 
investigates how language is enabled by the dynamics between biological organ- 
isms and their environment, focusing on those cognitive capacities that give rise 
to organisms’ flexible, adaptive behaviour. (my enumeration and boldface) 


Of course, a focus on language as such is only an abstraction and the above distinction 
merely highlights several important perspectives (Steffensen & Fill 2014: 7). Each of the 
four different ecologies influences all three kinds of linguistic diversity, i.e. language, 
phylogenetic, and structural diversity. 

In many cases the exact influence of the four ecologies is only beginning to be under- 
stood (e.g., De Busser 2015), which is why only a handful of examples connected with 
the grammar of questions can be given here. Symbolic ecology refers to the aspect of lan- 
guage contact that has a central position in areal linguistics. It encompasses phenomena 
such as the borrowing of linguistic items, the creolization of languages, or language shift. 
For example, many languages of China that share a common Chinese ad- or superstrate 
have borrowed the question marker ba IE (see below and §5.9.2.1). Natural ecology, too, 
is an aspect that should not be underestimated (e.g., Axelsen & Manrubia 2014). After 
all, the distribution of languages even today is determined to a large degree by natural 
and constructed affordances—roughly possibilities of action (Lewin 1936; Gibson 1979) 
—of our environment such as those of rivers, mountains, roads, bridges, or borders. Cli- 
mate clearly also influences all three types of linguistic diversity (e.g., Everett et al. 2015; 
2016). For example, languages that mark polar questions with intonation exclusively and 
do not have additional question marking strategies—similar to the total number of lan- 
guages—strangely cluster around the tropics (Dryer 2013j). In Northeast Asia there are 
almost no such languages. The sociocultural ecology plays an important role in language 


spread as well, but also influences the relative prestige and importance of languages. 
This has a direct influence on language shift and the direction of borrowing of linguis- 
tic items in language contact situations. As shown by Trudgill (2011) the social ecology 
can have a strong influence on the complexity of a given language, including aspects 
of the grammar of questions, such as the interrogative system (see §6.3). Furthermore, 
the culture and way of life of a speech community may have an impact on the struc- 
ture of languages. Cysouw & Comrie (2013: 388) argued, for instance, that the languages 
of hunter-gatherers might have preferences for certain linguistic features such as “rela- 
tively many cases of initial interrogatives”, although this could not be confirmed for NEA, 
which contains few real hunter-gatherer groups and few languages with sentence-initial 
interrogatives. The last point mentioned, the cognitive ecology, especially from a micro- 
genetic perspective, is an important factor in the structural properties the grammar of 
questions tends to have cross-linguistically. For example, there is a recurrent structural 
pattern among many different languages in which a content question is immediately fol- 
lowed by a polar, focus, or alternative question (e.g., What are you doing, are you crazy?), 
which can be explained by aspects of the human conceptual system (see $4.4, 86.3). 

In principle, all four perspectives are crucial for a complete investigation of language 
as well as the grammar of questions. Nevertheless, within this study the focus will lie 
on the aspect of language contact (symbolic ecology). Furthermore, a word of caution 
is in order. While most scholars would probably agree that there may be fundamental 
differences among individual symbolic, natural, and sociocultural ecologies, there is of- 
ten a tacit assumption of the uniformity of human cognition throughout the world. This 
is what Levinson (2012b: 397) has rightfully called "the original sin of the cognitive sci- 
ences—the denial of variation and diversity in human cognition? In fact, Henrich et al. 
(2010: 61) have quite convincingly shown that many previous investigations in cognitive 
science or psychology were strongly biased due to problematic samples of participants 
that do not accurately represent human diversity. This presents us with a severe problem. 
For instance, questions, it might be argued, can be seen as a way to verbally resolve cu- 
riosity. Problematically, publications on curiosity such as Reio (2011: 453) usually share 
this tacit assumption of universality: 


Curiosity is the desire for new information and sensory experience that motivates 
exploratory behavior. External stimuli with novel, complex, uncertain, or conflict- 
ing properties (i.e., collative stimuli) create internal states of arousal that motivate 
exploratory behaviors to reduce the state of arousal. 


Curiously, there are surprisingly few scientific investigations of curiosity. That is why 
this study necessarily follows this theory, which is basically a summary of Berlyne (1954; 
1960; 1978). But it should be borne in mind that there are personal differences of curiosity 
in both quantity and quality (e.g., von Stumm et al. 2011). 

The bulk of this study is a bottom-up comparison of the grammars of questions in dif- 
ferent languages and a tentative explanation of their similarities and differences in terms 
of some of the causal frames and ecologies sketched out above. As further explained in 
Chapter 4, the typology of questions proposed in this study will mostly concentrate on 
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question marking and interrogatives (see also Huang et al. 1999). This is a major differ- 
ence from previous approaches that are usually based on a distinction between different 
question types, such as polar and content questions. These two domains—question mark- 
ing and interrogatives—behave quite differently, for instance as regards the symbolic 
ecology and diachronic time scale. Interrogatives are known to be generally very con- 
servative (e.g., Diessel 2003). In many instances, an interrogative can even remain stable 
for thousands of years. For example, English where can be directly traced back over a 
time span of several thousand years to Proto-Indo-European *k” or with the same mean- 
ing (Mallory & Adams 2006: 419f.). Proto-Indo-European was probably spoken about 
6500 years before present (Anthony & Ringe 2015), which means that the interrogative 
is at least of this age. Diessel (2003: 649) thus correctly concludes that interrogatives (and 
demonstratives) “are generally so old that their roots are not etymologically analyzable". 
Theoretically, similar interrogatives can thus be employed to detect previously unknown 
old genetic connections between languages. In NEA there are a few possible examples 
of this sort. The most striking is a personal interrogative ‘who’ that has an uncanny sim- 
ilarity in several families, even if one goes back to the respective proto-languages (e.g., 
Proto-Mongolic "ken, Proto-Turkic “kim ~ "kám, Proto-Yukaghiric "kin etc.). This will 
be called the KIN-interrogative in this study (see 86.2.1). Furthermore, many languages 
in NEA have what will be called K-interrogatives, that is, they have several interroga- 
tives that share a so-called resonance (a submorpheme, see Bickel & Nichols 2007: 209; 
Mackenzie 2009: 1141) that has the form of a velar or uvular plosive or fricative (e.g., 
Nanai xar ‘what’, xado ‘how many’, xooni ‘how’). Given its fuzzy boundary and only 
partly analyzable character, a resonance will be indicated with a tilde (e.g., Nanai x-) in 
order to keep it apart from fully analyzable morpheme boundaries written with a hy- 
phen (e.g., Nanai xar-wa ‘what-acc’). This is similar to well-known submorphemes such 
as English gl-, found in gleam, glimmer, glisten, or glow. Despite the fact that the initial 
consonant cluster is not clearly analyzable, the individual instances nevertheless have a 
vague similarity in meaning. A resonance usually, but not necessarily, indicates a com- 
mon origin of different interrogatives within one language. It may be noted, however, 
that KIN- and K-interrogatives are, first and foremost, typological labels and do not nec- 
essarily indicate a common origin of different languages as was assumed by Greenberg 
(2000: 217-224). They are intended to be analogous to the well-known m-T-pronouns 
found throughout Eurasia, such as in English me and thee or Nanai mi 'T and si 'you 
(sc)' (see Nichols & Peterson 2013). Interrogatives are rarely borrowed, and when they 
are, this usually indicates an extreme contact situation or perhaps widespread bilingual- 
ism. Take Medny;j Aleut, for instance, which may be considered a truly mixed language. It 
exhibits interrogatives both of Aleut (e.g., kiin ^who") and of Russian (e.g., kuda ‘where’) 
origin (see $5.4.3). Bickerton (2016 [1981]: 65f.) and Muysken & Smith (1990) argue that 
creole and pidgin languages may have a preponderance of synchronically analyzable in- 
terrogatives such as English at what time. Because most languages contain at least some 
instances of analyzable interrogatives, it will be argued that, in order to identify such in- 
stances, the whole interrogative system needs to be investigated (Muysken & Smith 1990). 
In most cases of analyzable interrogatives in NEA the actual interrogative takes first po- 


sition (e.g., Manchu ai-ba- ‘what-place-’). Generalizing on Bickerton’s (2016 [1981]) and 
Muysken & Smith’s (1990) assumption, the emergence of several analyzable interrog- 
atives can be said to be an instance of simplification in the sense of a “regularization 
of irregularities”, an “increase in morphological transparency” (Trudgill 2011: 62), and 
a reduction in the number of actual interrogatives. This is most likely due to a specific 
type of strong language contact such as massive non-native language acquisition (e.g., 
McWhorter 2007). In sum, interrogatives may thus indicate different kinds of strong 
language contact (mixing, simplification) and perhaps very distant genetic relationships. 
The overall similarity of interrogative systems among related languages can also func- 
tion as a rough proxy for their time of divergence. 

Question marking behaves very differently from interrogatives. Of course, question 
marking may remain stable over long time spans in some cases, but generally is much 
less stable and more flexible than the interrogative system and is extremely sensitive 
to language contact. In NEA alone there are dozens of examples of borrowed question 
markers. One prominent example is the Chinese marker ba HE that marks polar ques- 
tions with an additional moment of supposition (‘isn’t it the case that’). The marker has 
been borrowed by many languages spoken in China today from diverse language fami- 
lies and in many different regions. Even structural question marking such as verb-first 
word order as found in Germanic languages has been adopted by some Uralic languages, 
for example (Miestamo 2011). Question marking thus has the potential to indicate lan- 
guage contact, and this it does quite independently of the intensity of the contact. Even 
relatively light contact may lead to the adoption of a question marker from other lan- 
guages. However, question marking cannot suggest distant language families. Without 
doubt, this difference between the two domains—question marking and interrogatives 
—is an example of the more general principle "that basic structural features tend to be 
stable, whereas pragmatically sensitive features such as politeness phenomena and evi- 
dentials tend to be unstable? (Trudgill 2011: 3) But interrogatives and question marking 
certainly represent the extreme ends of what may be conceptualized as a continuum. 
More or less, they are in complementary distribution when it comes to genetic inheri- 
tance and different types of areal contacts. However, the type of question marking (e.g. 
initial question marker) appears to be more stable than the actual form of the question 
marker. For instance, many Tungusic languages have a tendency for sentence-final po- 
lar question markers despite the fact that they are etymologically unrelated and attested 
many thousand kilometers apart, e.g. Sibe =na# at the Chinese Kazakh border or Even 
=Ku# in northeastern Siberia. The type of question marking thus seems to take a posi- 
tion between the two extremes. Therefore, the grammar of questions represents an ideal 
tool for the identification of linguistic convergence, possible middle- or long-range re- 
lationships, and instances of unusually extreme language contact. Linguistic diversity, 
just like archaeological records or the human genome, can thus function as a powerful 
source for the investigation of human prehistory over time spans of hundreds and thou- 
sands of years (e.g., Nichols 1992; Heggarty & Renfrew 2014b). In this study Northeast 
Asia functions as a testing ground for this tentative methodology (see 86.3). 
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Northeast Asia (NEA) here is first and foremost defined geographically as the region 
north of the Yellow River and east of the Yenisei (Figure 1.1). A natural boundary is 
formed in the north by the Arctic Ocean and in the east by the Pacific. In the northeast, 
the Bering Strait separates NEA from Alaska. NEA includes all islands along the Pacific 
Rim up to the Aleutian chain that are all located north of Taiwan, but excludes Taiwan 
itself, which has stronger ties with Southeast Asia. The islands in the Arctic Ocean are 
largely uninhabited, which renders them irrelevant for the purposes of this study. The 
Altai, the Kunlun, the Pamir, the Karakorum, the Tianshan, the Qinling, and the Tibetan 
Plateau will be taken as natural boundaries to the west, southwest, and south. 

Thus defined, NEA is a vast area that covers all of Japan, Mongolia, and the two Koreas 
as well as all of the Far Eastern Federal district, most of the Siberian Federal district of 
Russia, and northern China, including Manchuria, Inner Mongolia, Xinjiang, parts of the 
adjacent provinces, and certain parts of Tibet (Amdo). 

Unfortunately, Asia is a clear concept only until one tries to define it properly. It com- 
bines cultures and languages as diverse as Israel and the Asiatic Eskimos, it is located on 
several distinct tectonic plates, the largest of which includes Europe but not India, and 
there is no meaningful boundary of any sort that would clearly differentiate between 
Asia and Europe. Thus, in the end one is left with the two possibilities that Sinor (1990) 
was struggling with when trying to define the cultural area of Inner Asia. He was well 
aware that the term Inner Eurasia would have been more adequate, but today the term 
Asia is simply too strongly conventionalized and entrenched. This book similarly makes 
use of the term Northeast Asia, even though Northeast Eurasia might have been the better 
choice. Nevertheless, this makes it compatible with previous approaches with the same 
name and research on neighboring areas such as Southeast Asia (SEA). 

Apart from Northern China, Korea, and Japan, NEA is extremely sparsely settled. Even 
Northeast China (Manchuria) and northern Japan (Hokkaido) have only been settled in 
larger numbers within the last 150 years or so (e.g., Janhunen 1996). In contrast with 
the Western Siberian Lowland and the adjacent regions of European Russia and Eastern 
Europe, most of NEA may be said to be generally very mountainous or at least to be 
located at higher altitudes. NEA has important bodies of water, including lakes such as 
Lake Baikal, which defines something like the center of NEA, and several large rivers 
that play an important role for the dispersal of languages. In Russia these are, beginning 
from the west, the Yenisei, the Lena, the Indigirka, and the Kolyma, all of which flow 
into the Arctic Ocean. Further south, the Amur forms the border between Russia and 
China before it bends towards the northeast and flows into the Sea of Okhotsk. In China, 
the Liao flows into the Gulf of Bohai from the north and the Yellow River from the west. 
There are several smaller rivers such as the Yalu, which forms the border between North 
Korea and China, or the Anadyr in Chukotka. For the most part, NEA is characterized by 
a continental climate with cold and often dry winters but warm or hot and more humid 
summers. However, there are considerable regional differences ranging from a tundra 
climate in the northern parts of Russia, to a very humid subtropical climate in the south 
of Japan, to a desert climate in northwestern China as well as parts of Mongolia. The 
northern parts of NEA are mostly covered by Taiga and, further north, by tundra. As 
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Figure 1.1: Some natural boundaries of Northeast Asia; adapted from https://en. 
wikipedia.org/wiki/Geography_of_Asia, adapted from http://visibleearth.nasa. 
gov/view_rec.php?vevlid=11656 (Accessed 2016-04-10.) 
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one moves south, the Taiga changes into mixed forests that give way to the steppes in 
Inner and Outer Mongolia, the Manchurian and North Chinese Plain, the Ordos Plateau, 
as well as the deserts Gobi and Taklamakan (e.g., Taaffe 1990; Janhunen 1996; Narangoa 
& Cribbs 2014). 

Parts of NEA have been home to Homo erectus, Neanderthals, Denisovans, and possi- 
bly to other human (sub)species, the classification of which is still disputed. Despite the 
possibility that both Neanderthals and Denisovans may have had a language comparable 
to languages today (e.g., Dediu & Levinson 2013) and the fact that both interbred with 
modern humans (Sankararaman et al. 2016; Reich 2018 and references therein), there is 
no direct evidence for the languages these extinct groups may have spoken. For this 
reason, only the language of anatomically modern humans (AMH) can be investigated 
here. AMH reached NEA and even the northernmost parts of it at least 45 kya (Pitulko 
et al. 2016, see also Lbova 2014). However, the earliest records of any language in NEA 
are from Old Chinese and are only about 3250 years old and thus much younger than 
Sumerian (about 5000 years old) or Ancient Egyptian (about 4700 years old). If history 
is defined as that period when written language was present, in large parts of NEA it 
only started several centuries ago (Bellwood 2013). Linguistic reconstructions of some 
of the oldest proto-languages located in or close to NEA, such as of Austronesian, Trans- 
Himalayan (Sino-Tibetan), Uralic, and maybe Dene-Yeniseian, must be several thousand 
years older than Old Chinese records, but nothing comparable to the time of the first 
peopling of the area. 

The earliest accounts of Northeast Asia such as Nicolaas Witsen’s (1705) Noord en Oost 
Tartarye employed the term Tatary (or Tartary), but were quite inconsistent in their 
use of it. This name has dropped out of use today and in English there is at present no 
common designation for what has been defined as NEA above. Only in recent years has 
there been an increase in the West of publications bearing the name Northeast Asia in 
the title. Interestingly, this is a much more common concept in Japan (hokuto ajia JOR 
PYT), Korea (dongbuk asia), Mongolia (züün xojd azi), and China (dongbéi yàzhou FAL 
YEW), but apparently less so in Russia (severo-vostochnaja azija). The origin of the term 
has recently been concisely summarized by Narangoa & Cribbs (2014: 2): 


The term “Northeast Asia” is relatively new. It was introduced into academic dis- 
course in the 1930s by the American historian and political scientist Robert Kerner, 
who taught at the University of California. Kerner’s “Northeast Asia” comprised 
the Korean Peninsula, the Manchurian Plain, the Mongolian Plateau, and the moun- 
tainous regions of Eastern Siberia, stretching from Lake Baikal to the Pacific Ocean. 


In her recent book Early modern China and Northeast Asia, Rawski (2015) included more 
or less the same region. My account adds substantial areas to this definition, especially 
in the north and the west. Nevertheless, my approach is similar to Narangoa & Cribbs’s 
(2014: 2) and Rawski’s (2015) in trying to break down traditional conceptions of East Asia 
and a Sinocentric view. Interestingly, an older definition by Chard (1974: xv), which only 
came to my attention after the bulk of this study was already written, roughly coincides 
with my definition above: 
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The area covered comprises Siberia from the Altai Mountains and Yenisei River 
valleys eastwards, Mongolia, Manchuria, Korea, and Japan. This area has a certain 
coherence. Geographically, if we except western Siberia with its close affinities to 
European Russia, it represents the steppe, forest, and tundra zones of northern Asia, 
lying beyond the loess farmland of traditional China. 


The only difference concerns the exclusion of Xinjiang and other parts of northern China. 
Xinjiang happens to be included in NEA in this study because of its relatively old ties to 
central China due to Chinese expansions and trade along the Silk Roads, the presence 
of a great many northwestern Mandarin speakers today, and some linguistic connec- 
tions to Amdo and Mongolia. Xinjiang is also included in Nichols’s (1992: 25f.) concept 
of Northern Asia, which coincides with my definition, except that it includes those areas 
between the Yenisei and the Ural Mountains. In his recent book The peoples of Northeast 
Asia through time, Zgusta (2015: 21ff.) is not very clear about his definition of Northeast 
Asia, but he puts an emphasis on what he calls Pacific Northeast Asia, which only in- 
cludes northern Japan, Sakhalin, eastern Manchuria, Kamchatka, and Chukotka. Here 
this quite useful term will be adopted to additionally include all of Japan, Korea, and 
the area around the Gulf of Bohai, i.e. all of insular and peninsular NEA adjacent to the 
Pacific. 

The brief review above is not exhaustive but sufficiently illustrates a wide variety of 
overlapping designations and definitions of NEA. One of the few authors who draw a 
more differentiated picture is Janhunen (2010: 284): 


In the widest sense, Northeast Asia as a geographical and ethnohistorical region 
can be defined as the entire northeastern part of the Eurasian continent, delimited 
by the Yenisei in the west and the Yellow River in the south. In the northeast, the 
region extends, in principle, to the Bering Strait. In a somewhat narrower frame- 
work, Northeast Asia may be defined as comprising the territory between the Amur 
and Yellow River basins, including the Korean Peninsula and the Japanese Islands 
in the Pacific coastal zone, but excluding the northeasternmost limits of what is 
today the Russian Far East. (my boldface) 


This broad definition has clearly been influenced by Chard's point of view (Janhunen 
1996: 7). The narrow definition, on the other hand, is more or less identical with the 
perspective taken by Narangoa & Cribbs (2014) or Rawski (2015) seen above and may 
be more appropriately termed Greater Manchuria instead of Northeast Asia (Janhunen 
1996: 6). Needless to say, this study is based on a wide definition of NEA. 

The addition of the part and beyond to the title of this book has two meanings. First, 
some languages such as the Turkic languages Chuvash and Turkish that are located 
outside of, but have ties to, or in these cases even originate in, NEA, will be included 
as well. This problem of establishing a meaningful western boundary of, in their terms, 
northern East Asia has also been observed by Heggarty & Renfrew (2014a: 873): 


Turkish serves also to stress just how far the typological unity of this language area 
stretches beyond any geographical definition of East Asia. For in linguistic terms 
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- whether in family affiliations, typology or prehistory - northern Asia allows of 
no meaningful division into eastern or western parts. This language area covers its 
entirety, westwards to the Urals and, as Turkish (or Finnish) attest, in parts beyond. 
Its origin and core, however, do lie firmly within our scope here. 


Second, despite its focus on one area, this study is still intended to be applicable to other 
languages. Especially Chapter 4 is a more classical approach to typology that seeks to 
understand what grammars of questions are cross-linguistically attested and possible (cf. 
Hólzl 2016b). Therefore, it makes extensive use of data from languages outside of NEA. 
The survey of languages in Northeast Asia is intended to be as exhaustive as possi- 
ble. As Voegelin & Voegelin (1964: 2) put it: “In linguistic ecology, one begins not with 
a particular language but with a particular area, not with selective attention to a few 
languages, but with comprehensive attention to all the languages in the area? (my 
boldface) However, some individual languages are underrepresented because of a lack 
of data. The accuracy and amount of details of descriptions for languages and families 
varies considerably with my personal experience and the available literature. This book 
largely relies on previously published material, but several speakers and experts of indi- 
vidual languages were consulted as well. German examples are based on my knowledge 
as a native speaker. Given my educational background, literature in Chinese, English, 
and German form the linguistic core on which this book is based. There are a few French 
publications on NEA languages that were included as well. Russian and especially Jap- 
anese literature was consulted where possible, but not with equal intensity. Therefore, 
the southern part of NEA is somewhat overrepresented in this study. Finnish, Hungar- 
ian, Korean, and Mongolian publications were necessarily excluded. Other languages 
play no significant role for the study of the languages of Northeast Asia. Unfortunately, 
most grammatical descriptions are insufficient and only those in English and Japanese 
usually reach an international standard with adequate analyses of examples and gloss- 
ing. For a typological study, Chinese descriptions that have a rudimentary glossing with 
characters but usually lack a clear analysis, are usually more useful than German or Rus- 
sian publications that, with some exceptions, usually lack glosses or analyses completely. 
As a consequence, many of the examples found in this study have been painstakingly 
analyzed by myself as far as possible, by and large following the Leipzig Glossing Rules.” 
Remaining uncertainties are signaled with a question mark. For most of the languages 
in NEA only rather brief accounts are available. These are often limited to mentioning a 
handful of unexplained interrogatives with very rough translations and, with some luck, 
unanalyzed examples of polar and content questions. The length of the descriptions of 
the languages within this study also varies due to extreme differences in the complexity 
of the grammar of questions. It is not always easy to distinguish between simplicity and 
a lack of information. But there certainly are extremely complex systems such as in the 
Yupik languages that require several pages and tables just to give a rough outline. Some 
of the most complex systems can be found in Omotic languages (Afroasiatic) spoken 
in Ethiopia (see Amha 2012; Kóhler 2013; 2016, and references therein). In comparison 


?See https://www.eva.mpg.de/lingua/resources/glossing-rules.php (Accessed 2016-07-06.) 
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(i.e., relative complexity), most languages of NEA have much simpler and typologically 
more common grammars of questions (e.g., Miestamo 2008). Given the large number 
of languages included in this study, the description of individual languages is necessar- 
ily somewhat superficial and experts will certainly have a lot more to say about each 
of them. For several reasons, $5.10 on Tungusic is somewhat more extensive than those 
on other language families. First, my personal knowledge of Tungusic is better than for 
many other languages in this study. Second, there are extremely good descriptions of 
questions in some Tungusic languages such as Evenki and Udihe. Third, because of their 
vast distribution over almost all of NEA, Tungusic could potentially be crucial for this 
study (see Chapter 3). This study also includes several varieties that were described only 
from the 1980s onward by Chinese scholars but seem to have mostly gone unnoticed out- 
side of China. Tungusic languages will also sometimes be considered in other chapters 
to illustrate certain points. 

There have been several earlier studies on questions in the languages of NEA. There 
are many good descriptions of questions in individual languages such as Zhang Dingjing 
(1991) on Kazakh, M. Hayashi (20102) on Japanese or Yoon (2010) on Korean, to name but 
a few examples. There are far fewer studies of questions in more than one language, 
but still no exhaustive list can be given here. Audova (1997) briefly investigates ques- 
tion marking types in the northern part of NEA, but lacks a clear analysis and confuses 
interrogative verbs (a subtype of interrogatives) with question marking. Nevertheless, 
she makes some useful observations on possible areal connections. Luo Tianhua's (2013) 
dissertation is an investigation of questions in the languages of China and thus covers 
the southern half of NEA. Unfortunately, the overview of most languages is superficial 
and not always reliable. For instance, only two and a half pages are devoted to all the 
Tungusic languages spoken in China (Luo Tianhua 2013: 133-135). Several names of indi- 
vidual languages are erroneous and Korean is wrongly classified as a Tungusic language. 
Nevertheless, there are useful insights about questions in Mandarin and some other lan- 
guages. More problematic is Greenberg's (2000: 217-234) investigation of interrogatives 
in so-called Eurasiatic languages, which compares look-alike elements in a more or less 
random sample of languages and claims to have proven a genetic connection among 
them. A high-quality description of polar question marking in Uralic languages, on the 
other hand, some of which are spoken in NEA, is given by Miestamo (2011), which is also 
the most up-to-date description of polar question marking types. Yet another very good 
typology of questions in Austronesian languages of Taiwan, mostly excluded from this 
study, can be found in Huang et al. (1999). 

In sum, at its core this study is an investigation of the distribution of structural diver- 
sity in the grammar of questions in the limited geographical region of Northeast Asia 
and beyond. The restriction to one category is necessary for reasons of space and clarity, 
and the process of zooming in on one region allows a higher resolution and historical ac- 
curacy than is usually the case in linguistic typology. Some of the questions addressed by 
this study are: ^What does it mean to question?" (Sanitt 2011: 561) Are questions indeed 
universal, and if yes, why? What about questions is variable? How can this variation be 
classified? What are possible motivations behind this variation? What patterns do the 
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languages of Northeast Asia show with respect to this classification? What roles do ge- 
ography, genetic inheritance, and language contact play in explaining these patterns? Is 
there convergent evidence from other disciplines such as genetics? And finally, does the 
concept of Northeast Asia make sense from the point of view of areal linguistics? 

This book is organized into seven chapters, including this Introduction. Chapters 2 and 
3 briefly present the languages of NEA from a genetic and an areal perspective, respec- 
tively. Chapter 4 introduces a somewhat new typology of questions that is illustrated 
with languages from around the world. The longest chapter (Chapter 5) gives an exten- 
sive overview of the grammars of questions in the fourteen language families of NEA. 
Readers only interested in the typological aspects are advised to skip over this lengthy 
chapter and consult Chapter 6 instead, which gives an overview of the findings of the 
previous chapter, illustrated with several geographical maps inspired by the World Atlas 
of Language Structures (Dryer & Haspelmath 2013). Chapter 7 presents some conclusions, 
sketches possible avenues for further research, and briefly summarizes the tentative idea 
of an ecological typology. Following the extensive list of References, the Appendix lists 
the data that were used for the comparative maps of §6.4. At the end of the book there 
are Name, Language, and Subject Indexes. 
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Northeast Asia 


The validity of all fourteen language families of NEA has been proven by means of 
the classical comparative method. Hammarstróm et al. (2016) list about 430 different 
language families worldwide. Of these, Niger-Congo (called “Atlantic-Congo”, 1430 lan- 
guages) and Austronesian (1274 languages) are, in terms of individual languages, the two 
largest ones. Indo-European (583 languages) and Trans-Himalayan (475 languages) fol- 
low in places three and four. All other families found in NEA are considerably smaller, 
with several dozen languages at most. As regards the size of the individual languages, 
i.e. the number of speakers, there are similarly pronounced differences. By counting na- 
tive speakers only, Mandarin is the largest language worldwide with about one billion 
speakers. English has less than half the number of native speakers, but including second 
language learners, it must clearly be considered the largest language in the world, with 
perhaps up to twice as many speakers as Mandarin. Russian (ca. 150 million, Cubber- 
ley 2002), Japanese (ca. 130 million, Hasegawa 2015), Korean (ca. 75 million, Song 2005), 
Ukrainian (ca. 36 million, Young 2006), Uzbek (ca. 20 million, Johanson 2006b), Kazakh 
(ca. 10 Mio, Muhamedowa 2016), Uyghur (ca. 10 million, Tuohuti Litifu 2012), Mongolian 
(ca. 5 million, Janhunen 2003e), and Amdo Tibetan (ca. 1.3 million, Ebihara 2011: 42), have 
more than one million speakers. Of the rest, only Shuri, Yakut, Oirat, Tuvan, and Buryat, 
and perhaps Santa, have between 200,000 and one million speakers. Most of the remain- 
ing languages have well below fifty thousand speakers. But note that several languages, 
including Mandarin, English, Russian, Ukrainian, Uzbek, and Kazakh, are represented in 
NEA only by a fraction of the total number of speakers. 

The names Paleo-Siberian or Paleo-Asiatic (paleoaziatiskije jaz yki in Russian) are some- 
times still used as labels for several language families (e.g., Tsumagari et al. 2007), espe- 
cially Amuric, Chukotko-Kamchatkan, Yeniseic, and Yukaghiric, sometimes expanded to 
include Ainuic. But this label should be avoided whenever possible, as it does not refer 
to any valid genetic, areal, or typological grouping. 

Ainu, Korean, Nivkh, and sometimes even Japanese, are considered to be linguistic 
isolates that are not related to any other known language. However, the difference be- 
tween a language isolate and a language family is a matter of degree rather than kind. 
Historically, an isolate necessarily is part of a larger stock that has already disappeared, 
or the relationship to other languages is too remote to be detectable. A case in point 
is the language Ket. It is known to be part of the Yeniseic language family, but is its 
sole survivor. Recent years have seen the rise of the so-called Dene-Yeniseian hypothesis, 
which claims a genetic connection between Yeniseic and the Na-Dene languages in North 
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America. Without the historical attestation of now extinct varieties of Yeniseic, neither 
the Yeniseic language family nor its connection to Na-Dene would be known today, and 
Ket would simply count as a linguistic isolate. Japanese is certainly not an isolate, but to- 
gether with the Ryükyüan languages forms the Japonic or Japanese-Ryükyüan language 
family. In addition, Ainu, Korean, Nivkh, and Japanese all have a certain amount of in- 
ternal diversity that is usually described as dialectal variation. Given the absence of any 
clear definition of what characterizes a language as opposed to a dialect, a clear distinc- 
tion between an isolate and a language family cannot be drawn. In order to make the 
description analogous to the other language families, the designation of the language 
families of Ainu, Korean, and Nivkh will be Ainuic, Koreanic, and Amuric (Janhunen 
1996), respectively. 

A special group of Northeast Asian languages is formed by several pidgins, creoles, 
and mixed languages. Their classification is open to debate and depends on the theory 
of genetic relatedness one adopts (Operstein 2015: 1-3). The pidgins, both of which are 
extinct by now, were called Govorka (Taimyr Pidgin Russian, Russian x Nganasan), and 
Chinese Pidgin Russian x Chinese). Both are strongly based on Russian, which is why 
they will be treated together with the other Indo-European languages (882.5, 5.5). Mixed 
languages include Copper Island Aleut (Aleut x Russian) and Eynu (Uyghur x Persian). 
For practical purposes these will be treated together with Eskaleut (882.4, 5.4) and Turkic 
(882.11, 5.11), respectively. An Ainu-Itelmen hybrid will not be included as it is extinct 
and has not been recorded to a sufficient degree (Fortescue 2003: 81). Yilan Creole, the 
only language of Taiwan included in this study, is basically Japanese (882.6, 5.6), but has 
been strongly influenced by Austronesian languages. The status of several varieties in 
the Amdo Sprachbund, especially Gangou, Hezhou, Tangwang, and Wutun (all Sinitic x 
Turkic x Mongolic x Tibetic), remains somewhat unclear. But there are some indications 
that they are creolized varieties of Sinitic and thus will all be treated together with Trans- 
Himalayan (Sino-Tibetan, §§2.9, 5.9). Several languages, including Alchuka, Bala, Kili, 
Kilen, and Ussuri Nanai, are to different degrees a mixture of several Tungusic languages 
and therefore treated in $2.10 and $5.10 on Tungusic. 

The Indo-European languages Latin, Sanskrit, and Prakrit as well as the Semitic lan- 
guages Arabic, Aramaic, and Hebrew, all of which were at some point used as literary 
languages in parts of NEA, will be excluded. The two Indo-European languages Dutch 
and Portuguese had only a short-lived and, at least for the purposes of this study, unim- 
portant presence in the maritime southeast of NEA. Today, globalization brings many 
different languages from all around the world into NEA, especially the larger cities in 
the south. But apart from English, these languages will be neglected, too. NEA may have 
been home to languages and whole language families that have disappeared without 
leaving any records. Some of them may be accessible through the study of loanwords. 
A case in point is the hypothetical language of the Rouran empire ($A, 330-555 CE) 
around Mongolia, for which Vovin (2004) has collected a small amount of material. He 
concludes that it is probably not related to any surrounding language known to us to- 
day. Unfortunately, almost nothing is known about its grammatical structure, let alone 
its grammar of questions. Another language or family of languages that apparently has 


16 


disappeared without trace (Fortescue 2013) was presumably spoken by the recently dis- 
covered Paleo-Eskimos. 


Paleo-Eskimos likely represent a single migration pulse into the Americas from 
Siberia, separate from the ones giving rise to the Inuit and other Native Ameri- 
cans, including Athabaskan speakers. Paleo-Eskimos, despite showing cultural dif- 
ferences across time and space, constituted a single population displaying genetic 
continuity for more than 4000 years. On the contrary, the Thule people, ancestors 
of contemporary Inuit, represent a population replacement of the Paleo-Eskimos 
that occurred less than 700 years ago. (Raghavan, DeGiorgio, et al. 2014: 1020) 


This is by no means the only prehistoric population that is attested in NEA, but the re- 
cency of their spread would in principle make them accessible with the standard tools 
of historical linguistics. Recently, genetic studies came to the conclusion that not only 
populations in Chukotka, but also Kets, Nganasans, Selkups, Yukaghirs (Flegontov et al. 
2016), and speakers of Eskaleut and Na-Dene languages (Reich 2018: 175, 183) are genet- 
ically related to the Paleo-Eskimos. It would be tempting to connect this evidence with 
the Dene-Yeniseian hypothesis (Vajda 2010), but thus far we cannot bring together the 
linguistic and genetic data as there are too many possible variables. It has by now been 
demonstrated that not only the Paleo-Eskimos, but in fact all native American popula- 
tions can be traced back to Asia. In other words, all extant and innumerable extinct in- 
digenous American languages necessarily have their origin in NEA in prehistoric times. 
The so-called Beringian Standstill Model assumes that a population had lived relatively 
isolated in Beringia, now mostly covered by water, before entering the Americas when 
the glaciers were on their retreat and the sea levels started to rise (e.g., Moreno-Mayar 
et al. 2018). Llamas et al. (2016: 1), based on genetic evidence, recently argued “that a 
small population entered the Americas via a coastal route around 16.0 kya, following 
previous isolation in eastern Beringia for ~2.4 to 9 thousand years after separation from 
eastern Siberian populations.” (corrected) In other words, the predecessors of most na- 
tive American languages—possibly excluding speakers of Na-Dene, hypothetical Paleo- 
Eskimo, and Eskaleut, all of which spread over North America much later—were still 
around in Beringia, arguably a part of NEA back then, as recently as 16,000 years ago. It 
is plausible to assume that this Beringian area harbored a certain amount of linguistic 
and genetic diversity. For example, there is evidence for a population that today only 
left some genetic traces in Amazonia and is more closely related to Australasians (see 
Reich 2018: 176-181 and references therein). This time depth of up to 24,000 years of sep- 
aration of Siberian and these early native American populations lies well beyond the 
perhaps 10,000 or so years that are, given ideal circumstances, accessible by means of 
the comparative method. This means that, from a purely linguistic point of view, gener- 
ally only a fraction of prehistory, namely the Holocene (from ca. 9,500 BCE, Bellwood 
2013: 5f.), is actually accessible. Even so, the age of most language families in NEA is 
considerably lower and does not even approach that age. The data in Table 2.1 are only 
approximations and different authors give different estimates. The data quoted were cho- 
sen because their point of view seems to be by and large the most accurate according to 
my current understanding. 
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2 An overview of language families in Northeast Asia 


Table 2.1: Approximate rounded age and homeland of the 14 language families; 
arrows indicate the possible location of the pre-proto languages 


Family Estim. age Location Source 
Trans-Himalayan ?9000-8000 ?eastern Himalayas Blench & Post 2014 
Indo-European 6500 north of Black Sea Anthony & Ringe 
2015 
Eskaleut 5000 Chukotka — Fortescue 2013 
Southern Alaska 
Uralic 5000 western Siberia Janhunen 2009 
Chukotko-Kamchatkan 4000 Kamchatkan Fortescue 2005 
isthmus 
Japonic 2200 southwest Korea —> Janhunen 2010; 
Honshü Sean & Toshikazu 
2011 
Turkic 2200 south Siberia, Yunusbayev et al. 
Mongolia 2015 
Tungusic 2000 middle Amur Pevnov 2012 
Yukaghiric 22000 Baikal — middle Hakkinen 2012; 
Lena, Indigirka Maslova 2003a 
Yeniseic 1500-2200 northern China —> Vajda 2004; Vovin 
south Siberia et al. 2016 
Ainuic 1300 Honshü — Sean & Hasegawa 
Hokkaido 2013 
Mongolic 800 northeastern Janhunen 2003d 
Mongolia 
Amuric 500-1500 Upper Amur Fortescue 2011; 
Janhunen 2010 
Koreanic 500-1000 southeast Korea Janhunen 2010 


§2.1 to 82.14 will briefly introduce all 14 language families of NEA in alphabetical order. 
Details of the internal classification of the language families, as well as their grammars 


of questions, will be described in Chapter 5. 
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2.1 Ainuic 


2.1 Ainuic 


Bugaeva (2012: 463) estimates that there are about 100,000 ethnic Ainu, of whom only a 
handful still speaks the language. Historically, there are three major groups of dialects, 
the Sakhalin dialects, the Kuril Islands dialects, and the Hokkaidodialects (e.g., Bugaeva 
2012: 461). Proto-Ainuic has roughly been dated “to the last centuries of the first mil- 
lenium A.D? (Vovin 1993: 155). The spread of the three branches probably started in 
northern Hokkaido (Sean & Hasegawa 2013) and covered a vast area reaching Sakhalin 
in the Northwest and the Kuril Islands and maybe even the tip of southern Kamchatka in 
the Northeast. Today, most Ainu have shifted to Japanese and the last speakers are only 
found on Hokkaido. Most of the Sakhalin Ainu moved to Japan after the Second World 
War and the Kuril Island Ainu were relocated as early as 1884. Both groups of dialects 
are extinct today. Genetic research has revealed that the Ainu are the result of an ad- 
mixture from the continental Okhotsk people (perhaps connected to the Nivkh) into the 
Satsumon population, which itself goes back to the Jomon population (Takehiro et al. 
2007). It is known through the study of place names in the Tohoku region of Honshü 
that speakers of Ainu or a language closely related to Ainu once must have lived there 
as well. According to Bentley (2008b: 33), Chinese recordings of Yamatai toponyms, pre- 
sumably located in southern Japan, are predominantly Japanese, but may also contain 
several Ainuic elements. The most likely scenario that also takes recent genetic stud- 
ies into consideration (Jinam et al. 2012), is that the Ainu, because of the arrival of the 
Japonic-speaking Yayoi people in Honshi, migrated from Honsht to Hokkaido, where 
they mixed with people from the Amuric speaking Okhotsk population, but preserved 
their language and subsequently spread to the surrounding regions (Sean & Hasegawa 
2013: 5). Up to this point in time, no genetic connections of Ainuic with other languages 
or language families have been proven. The best but still not absolutely convincing at- 
tempt to clarify the prehistory of the Ainu language has perhaps been made by Vovin 
(1993: 175), who could “definitely say that Proto-Ainu is unrelated to any of the neigh- 
bouring languages.” He proposed a possible connection with Austroasiatic but this is not 
generally accepted. Hirofumi & Oxenham (2013: 219) summarized research on the origin 
of the Jomon population and concluded “that it ultimately derived from the modern 
human colonizers of Late Pleistocene Southeast Asia and Australia, who subsequently 
mixed with later migrants from the northern part of East Asia during the early Jomon 
period (c. 12-7 kya) or before”. This would be in accordance with Vovin’s claim of a south- 
ern origin, but given the great time depth of the Jomon culture of 12 ky and the extremely 
shallow time depth of Ainuic, no further hypothesis can be drawn on possible linguistic 
connections. For the time being, Ainuic has to be recognized as a stock on its own, but 
with possible connections to Mainland Southeast Asia and beyond. 

The contact languages of Ainuic were Japonic in the South, and Amuric in the North 
(e.g., Vovin 2016). There is also strong contact to Russian as well as the Tungusic language 
Uilta on Sakhalin and, on the southern tip of Kamchatka, to Itelmen. Ainu used to be a 
lingua franca in southern Sakhalin during the 19th century, and was even used by the 
Japanese (Yamada 2010: 65). 
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2.2 Amuric (Nivkh) 


The designation Amuric has been introduced by Janhunen (1996) to refer to the lan- 
guage family to which Nivkh, previously called Gilyak, belongs. The internal diversity 
appears to be similar to that of Ainuic, with some dialects being mutually unintelligible 
(Gruzdeva 1998: 7). No relation with other languages has been proven, although Fortes- 
cue (2011) recently argued for the possibility of a remote relationship with Chukotko- 
Kamchatkan languages, which has yet to be verified. There are at most a few hundred 
speakers left out of a population of a few thousand. Amuric has often been linked with 
the Okhotsk culture (5th to 13th century AD), which reached as far as Hokkaido and the 
Kuril Islands (Fortescue 2011) and had a strong impact on the Ainu (see also Vovin 2016). 
Based on evidence from the cultural lexicon, Janhunen (2010: 294) assumes an origin of 
Amuric further to the south in central Manchuria. However, this contradicts both the 
assumption that Tungusic was spoken along the middle Amur (§2.10) and the hypoth- 
esis that the Okhotsk culture was Amuric-speaking. Today, Nivkh is spoken along the 
mouth of the Amur and in some villages on Sakhalin and perhaps by a few speakers who 
were resettled in Hokkaido after the last world war (Fortescue 2016: 1ff.). 

Nivkh had intense contacts with several Tungusic languages (e.g., Gusev 2015b) both 
at the lower Amur (e.g., Negidal, Ulcha), and on Sakhalin (Uilta, Sakhalin Evenki), where 
there was also contact with Ainuic and, for a short period, with Japanese (see also Ya- 
mada 2010). In addition, there is some evidence for old contacts between Amuric and 
Ainuic (see Vovin 2016). The most important contact language today is Russian, and 
most Nivkh have switched to speaking Russian. 


2.3 Chukotko-Kamchatkan 


The status of Chukotko-Kamchatkan (or Luoravetlan) as a language family is not rec- 
ognized by some authors, notably Georg & Volodin (1999). But Fortescue (2003; 2005; 
2011) has quite convincingly shown that it has a firm basis. The language family falls 
into two major branches, Itelmen (Kamchadal) on the one hand and a more diverse 
branch including Chuckchi, Alutor, Koryak (Nymylan), and Kerek, on the other hand. 
All scholars agree that Chuckhi, Alutor, Koryak, and Kerek are related, and the contro- 
versy surrounds the question of whether Itelmen belongs to the same language family or 
not. Concerning the origin of Chukotko-Kamchatkan (CK), Fortescue (2005: 3) assumes 
the following scenario. 


The linguistic “centre of gravity’—suggesting the original CK “homeland”—lies 
around the Kamchatkan isthmus [...], an area presumably reached from the west 
along the coast of the Okhotsk Sea long before the introduction of the reindeer- 
herding from further west within the last thousand years or so [...]. The time at 
which proto-CK may have been spoken in this general area by hunters of wild 
caribou has been estimated as somewhere around four thousand years [...]; this co- 
incides with the beginnings of the Neolothic cultures of Tarya on Kamchatka and 
(a little later) Ust-Belaya on Chukotka. 
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2.4 Eskaleut (Eskimo-Aleut) 


In agreement with an original location further to the west and perhaps to the south, 
Fortescue (2011) has recently argued for an old genetic relation of Chukotko-Kamchatkan 
with Amuric, which seems possible but remains to be verified. A recent genomic study 
has shown that the Chukchi derive about 40% of their genome from a back-migration of a 
native American population to Asia (Reich 2018: 184). Ifthe same is true for all Chukotko- 
Kamchatkan-speaking populations, this opens up the possibility that Pre-Proto-Chukot- 
ko-Kamchatkan, or a contact language thereof, can be traced to North America. 

Two historically attested dialects of Itelmen as well as Kerek have already disappeared, 
and all the remaining languages except for Chukchi, which has about 10,000 speakers, are 
highly endangered. Concerning the lifestyle of the speakers of Chukotko-Kamchatkan, 
Anderson (2006a: 416) mentions an interesting split. 


Along the coasts, Chukchi people live as sea mammal hunters, like the local Yup’ik 
populations, but they live as reindeer herders in the interior. Approximately three- 
quarters of the Chukchi live as reindeer herders. Northern Kamchatkan groups 
mainly practice reindeer-oriented economies and fishing and sea mammal hunting 
along the coasts. The Itelmen live primarily as subsistence fishers. 


The herding of reindeer must be a relatively recent innovation brought to the North- 
east of NEA by other people from the west, but may have been the driving factor in a 
secondary expansion of Chukchi. 

Chukotko-Kamchatkan languages had contact mostly with Even, parts of Yupik, Yuk- 
aghiric, Russian and, less importantly, English. Itelmen seems to have had contact with 
Ainuic as well. 


2.4 Eskaleut (Eskimo-Aleut) 


Eskaleut languages are for the most part not spoken in NEA, but in Alaska, Canada, 
and Greenland (e.g., Berge 2006). The primary split is between Eskimo and Aleut, the 
former having an additional division between Yupik, Inuit, and perhaps Sirenikski (e.g., 
Fortescue et al. 2010: x). In this study only those Eskaleut languages spoken in or in the 
vicinity of NEA will be included. These are Sirenik(ski), which is extinct, and Naukan(ski) 
Yupik on the mainland, Central Siberian Yupik on St. Lawrence Island, and Aleut as 
well as Mednyj Aleut on the Aleut Islands. The languages have all reached their present 
location from Alaska, where the homeland of Eskaleut was probably located. Very early, 
at least several thousand years ago, the Aleut started migrating along the Aleut islands 
towards Asia. 


It can only be surmised that the movement that separated Aleut from Eskimo oc- 
curred soon after the first arrival of the Eskimo-Aleut family in Alaska over Bering 
Strait, at least four thousand years ago and some two thousand years before the 
Inuit-Yupik split. The linguistic evidence suggests at least two major phases here 
—an ongoing spread westwards as far as the outermost Near Islands (reached some 
2,500 years ago), overlaid in more recent times (only a few hundred years ago) by 
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a wave bearing specifically Eastern Aleut influence from the Alaskan peninsula. 
(Fortescue 2013: 344f.) 


The best known and most important expansion of Eskimo was about a thousand years 
ago to northern Canada and Greenland. But there were migrations on the Asian side as 
well, which are more important for the present study (Fortescue 2004). 


On the Asian side of Bering Strait, at approximately the same time as the Thule 
migration eastward from North Alaska, a westward expansion of Punuk culture 
whaling people probably speaking Central Siberian Yupik was initiated. This even- 
tually reached as far as the Kamchatkan isthmus in the 15th century, as linguistic 
evidence suggests, although the Eskimo presence must have been short-lived or 
absorbed by maritime Koryaks and—especially—Kereks (Fortescue 2013: 344) 


It is, of course, generally accepted that Pre-Proto-Eskaleut had been located on the 
Asian side before crossing over to Alaska, but according to Berge (2010: 558) and Fortes- 
cue (2013) this must have been at least 4000 years ago. The possible existence of a few 
Eskimo loanwords in Tungusic languages cannot change that basic fact (cf. Vovin 2015). 

Fortescue (2013: 344) hypothesizes that Sirenikski may be “a pocket of archaic Eskimo 
much influenced by Chukchi” Aleut probably had contact with unknown languages in 
Alaska and perhaps the Aleut Islands. Both Aleut and Yupik as spoken in Asia had strong 
contact with Russian and, less importantly, with English. 


2.5 Indo-European 


Indo-European is the most widespread and the largest language family worldwide in 
terms of speakers. About one third of the global population speaks an Indo-European 
language. Proto-Indo-European was presumably located on the Pontic-Caspian steppe, 
perhaps about 4500 BCE (Anthony & Ringe 2015), although there are competing but 
in my eyes much less likely hypotheses, for example of a location in Anatolia south 
of the Black Sea (e.g., Heggarty 2013). There is convergent evidence from the human 
genome, archaeology, and linguistics for the location on the Pontic-Caspian steppe (e.g., 
Anthony 2007; Allentoft et al. 2015; Anthony & Ringe 2015; Haak et al. 2015; Jones et al. 
2015). According to one prominent view, the subsequent spread and the divergence of 
Indo-European branches can be summarized as follows: 


Archaic Proto-Indo-European (partly preserved in Anatolian) probably was spo- 
ken before 4000 BCE; early Proto-Indo-European (partly preserved in Tochar- 
ian) was spoken between 4000 and 3500 BCE; and late Proto-Indo-European (the 
source of Italic and Celtic with the wagon/wheel vocabulary) was spoken about 
3500-3000 BCE. Pre-Germanic split away from the western edge of late Proto- 
Indo-European dialects about 3300 BCE, and Pre-Greek split away about 2500 BCE, 
probably from a different set of dialects. Pre-Baltic split away from Pre-Slavic and 
other northwestern dialects about 2500 BCE. Pre-Indo-Iranian developed from a 
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2.5 Indo-European 


northeastern set of dialects between 2500 and 2200 BCE. (Anthony 2007: 82, my 
boldface) 


Indo-European has a dozen major branches, four of which have, or formerly had, rep- 
resentatives in Northeast Asia as defined here: Tocharian, Iranian (part of Indo-Iranian), 
(East) Slavic, and (West) Germanic. Historically speaking, Indo-European languages en- 
tered Northeast Asia at roughly three different times. 

Pre-Tocharian, which may have branched off from Indo-European about 5300 years 
ago (before all other branches except Anatolian), probably reached the Altai mountains 
shortly afterwards and is associated with the Afanasievo culture (ca. 3300-2500 BCE) 
(Mallory 2010: 51; Anthony & Ringe 2015: 208). The Afanasievo culture showed a south- 
ward expansion, which would explain why Tocharian is only attested further south in 
the Tarim basin in two different forms known as Tocharian A (East) and B (West) (e.g., 
Winter 1998). There are indications of the existence of a third language (Tocharian C), 
which is attested exclusively in loanwords (Mallory 2010: 48f.). Tocharian has been ex- 
tinct for at least a thousand years. 


Tocharian A, found in documents near Turfan and Qarasháhár, and Tocharian B, 
found mainly around Kucha in the west but also in the same territory as Tocharian 
A. The documents, dating from the 6th to the 8th centuries CE, suggest that Tochar- 
ian A was by that time probably a dead liturgical language, while Tocharian B was 
still very much in use. In addition to Tocharian, administrative texts have been dis- 
covered in Prakrit, an Indian language from the territory of Kroran [lóulán f]; 
these documents contain many proper names and items of vocabulary that would 
appear to be borrowed from a form of Tocharian (sometimes known as Tocharian 
C) spoken by the native population. The Kroránian documents date to ca. 300 CE 
and provide our earliest evidence for the use of Tocharian. For our purposes here, 
it is also very important to note that the earliest evidence for the mummified re- 
mains of “westerners” in the Tarim Basin is found in cemeteries at Xiaohe [/] V] 
(Small River) and Qàwrighul [güámügou 772274], both of which are located in the 
same region as Tocharian C. (Mallory 2010: 48f., my square brackets) 


There are alternative names for Tocharian A, such as Agnean after the Sanskrit name 
Agni (yānqí SS) for the city of Karashahr, and for Tocharian B, such as Kuchean after 
the city of Kucha (qiüzi f&2& and variants) (e.g., Fortson 2010: 400; Geng Shimin 2012). 

Tocharian was in contact with several Iranian languages that entered the Northeast 
Asian scene after Tocharian, but were probably present in the Tarim basin as early as 
1300 BCE (Mallory 2010: 50). Iranian together with Indic and maybe Nuristani as an in- 
dependent subbranch, forms the Indo-Iranian branch of Indo-European (Fortson 2010: 
202f.). Iranian language history is usually divided into an Old Iranian (until the 4th or 
3rd century BCE), a Middle Iranian (until the 8th or 9th century CE), and a Modern Ira- 
nian period (e.g., Schmitt 2000: 3). Iranian languages only had a wide distribution in 
NEA during the Middle Iranian period. The two languages Khotanese (hétián sáiyd $N 
FH SÉ iE, in the South of the Tarim basin, ca. 5th to 10th century CE, Emmerick 2009: 
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377ff.) and Tumshuquese (tumushuke sàiyù | Stop in the North, 7th to 8th cen- 
tury CE), closely related and usually collectively called Saka (sáiyü 32), were more re- 
stricted in their distribution than Sogdian (Emmerick 2009; Geng Shimin 2011). Sogdian 
(sütéydü IFE, ca. 4th to 11th centuries CE, Yoshida 2009: 279ff.) was originally spoken 
in present-day Uzbekistan and Tajikistan, but “the Sogdians played an active role as in- 
ternational traders along the Silk Road between China and the West, with the result that 
the Sogdian language became a kind of lingua franca in the region between Sogdiana 
and China” (Yoshida 2009: 279). Regarding modern Iranian, only the Pamir languages 
Sarikoli (sálikü'er B® Fi ER) and Wakhi (wáhán FL), treated as dialects of one lan- 
guage called tajikéyu 347% 5218 (Gao Erqiang 1985: 101) but not to be confused with 
the Tajik language, as well as the mixed Persian-Uyghur language Eynu, are spoken in 
NEA. However, the discussion will also briefly mention Yaghnobi, which is located in 
Tajikistan but represents the only modern language that is closely related with Sogdian. 

The last period of Indo-European influx brought Eastern Slavic as well as Germanic 
languages into NEA. Together with the Baltic languages, Slavic forms the Balto-Slavic 
branch of Indo-European. Only the East Slavic languages Russian and Ukrainian ex- 
panded into NEA. Russian is not only the dominant language of the Russian Federation, 
but has also had some influence on several languages outside of Russia, such as Mongo- 
lian or Uyghur. Many speakers of languages in the Russian Federation are bilingual in 
Russian or are even shifting to Russian as their primary language. Ukrainian only plays 
a marginal role, but nevertheless can be found scattered across the Russian-speaking 
area. Slavic originates in Eastern Europe, perhaps northwest of the Black Sea (Fortson 
2010: 420f.) and the Russian expansion beyond the Urals only started in the 16th century. 
By 1625 the Russians reached the Yenisei, and by the end of the 17th century they had 
conquered most of Siberia, excluding only Outer Manchuria, Chukotka, and southern 
Kamchatka (Forsyth 1992: 102). This means that Russian played no role in NEA until 
about 400 years ago. There is a mixed Russian-Ukrainian language called Surzhyk, of 
which some speakers are most likely also found in NEA, but which must be neglected 
for lack of sufficient information (Bilaniuk 2004). 

Only West Germanic languages are marginally represented in NEA by scattered mi- 
norities of German (especially Altai Low German) speakers living in southern Siberia as 
well as a certain amount of influence from American English as spoken in Alaska and 
the Aleut Islands. Yiddish is included here mostly because of the existence of a Jewish 
Autonomous Oblast in Russia close to Khabarovsk, where a handful of Yiddish speakers 
can be found and where it has an official status. Yiddish is a descendant of primarily 
southeastern Middle High German that was extensively influenced by Slavic, Hebrew, 
and Aramaic (Jacobs et al. 1994). Altai Low German (or Plautdiitsch) *is the descendant 
ofthe Low German (Low Prussian and Pommeranian) dialects once spoken in the Danzig 
area.” (Nieuweboer 1999: 13) There is only limited information on questions in Altai Low 
German, but Standard German, a liturgical language for Siberian speakers of German di- 
alects, can give some rough indications about how the blanks may be filled in. There was 
an English jargon introduced with English-speaking whale hunting crews especially in 
Chukotka (de Reuse 1996). English is perhaps the major foreign language in large parts 
of NEA and there are many native speakers, often soldiers, in Japan and South Korea. 
Furthermore, it often serves as a lingua franca in international communication. 
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2.6 Japonic (Japanese-Ryükyüan) 


The Japonic language family most likely had its origin on the Korean Peninsula and 
only later expanded into the Japanese archipelago. This expansion is connected with the 
Yayoi people, originally perhaps farmers along the Yangtze, who after 850 BCE via Ko- 
rea spread to Japan where they arrived by about 400 BCE (e.g., Janhunen 2003a; Sean & 
Toshikazu 2011; Hirofumi & Oxenham 2013: 219; Siska et al. 2017: 2f.). The Yayoi people 
mixed with and replaced the original Jomon population, their hunter-gatherer lifestyle 
as well as their languages. Peripheral areas such as Hokkaido and the Ryükyüan Is- 
lands preserve stronger traces of the Jomon genome. But while Ainuic languages in 
Hokkaido may represent the last remnants of the Jomon languages, Ryükyüan languages 
are clearly related to Japanese. According to Vovin (2013b: 202), the southward migration 
of Ryükyüan only started in the 9th century. 

According to one classification, Japanese can be divided into Old (592-794), Late Old 
(794-1192), Middle (1192-1603), and Early Modern Japanese (1603-1867) (Hasegawa 2015: 
5ff.). Old Japanese can be further divided into Eastern, Central, and Western Old Japanese. 
Eastern Old Japanese was spoken in what today is the Kanto area in the 8th century CE, 
while Western Old Japanese is the language from Nara (Kupchik 2011). Hachijo is the 
only modern descendant of Eastern Old Japanese (Kupchik 2011: 9). Central Old Japanese, 
thought to be the predecessor of Modern Japanese, is almost unknown (but see Kupchik 
2011: 7f., 852). Old Japanese has to be distinguished from Classical Japanese, which was 
based on Late Old Japanese as defined above and served as a literary language (Tranter 
20122). There is evidence for the former presence of Para-Japonic or Japonic languages on 
the Korean Peninsula as well as on Jeju Island (Vovin 20132), but no information relevant 
for this study can be obtained from these long-gone varieties (see also Beckwith 2007 
and especially Pellard 2005 for some discussion). 

Japonic had contact with Ainuic, Koreanic, Sinitic, Amuric, Uilta etc. Modern Japa- 
nese, furthermore, has been influenced by several European languages and especially 
English. Contact with Austronesian on Taiwan led to the emergence of Yilan Creole. 
The dialects of Japanese as well as Ryükyüan languages are both increasingly being re- 
placed by Standard Japanese, which itself is based on the Tokyo dialect in the Eastern 
dialect area (Sanada & Uemura 2007). Yilan Creole is under Chinese influence. 


2.7 Koreanic 


The internal dialectal differences of Korean should not be underestimated, and some of 
these dialects, notably Jeju on Jeju island and Yukcin in the Northeast, have been said to 
exhibit language-like differences with regard to other varieties of Korean. It is therefore 
possible to speak of the Koreanic language family instead of a Korean isolate. Regarding 
the origin of Koreanic, Vovin (2013b: 201) has recently argued for a location in the north: 


It appears that the migration of the Korean[ic] speakers to their present location 
was quite straightforward, from southern Manchuria in the north to the Korean 
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Peninsula in the south. The linguistic process of Koreanization took several cen- 
turies, and it appears that proto-Korean[ic] or pre-Old Korean gradually replaced 
[Para-]Japonic languages between the 3rd and 8th centuries cr. The central and 
southern parts of the Korean Peninsula were originally [Para-]Japonic speaking. 
(my square brackets) 


Today, Koreanic is distributed across the entire Korean Peninsula as well as adjacent 
parts of China, parts of Sakhalin, and even Central Asia. Theoretically, Central Asian 
Korean (Kolyemal) as spoken in eastern Uzbekistan, for example, is located outside of 
Northeast Asia. However, given its location very close to Xinjiang and the fact that it 
preserves several conservative features that were lost in Korea, it will also be included. 

Korean is historically attested in several stages that may be called Old Korean, Middle 
Korean, and Modern Korean, but recent descriptions disagree on how exactly the histori- 
cal stages of Korean should be classified. Whitman (2015) considers Old Korean to be the 
language of Unified Silla (668-935 CE), while Nam (2012: 41) argues that the Old Korean 
period already began in the 59^ century CE. 


We divide Old Korean (OK) into Early, Mid and Late Old Korean (EOK, MOK, LOK). 
EOK was the Korean of the Three Kingdoms period, roughly from the start of the 
fifth century until Silla unified the Three Kingdoms in the 660s. MOK was the Ko- 
rean of the Unified Silla [Sinla] period, from the 660s until the 930s when Koryó 
[Kolye] re-unified the country. LOK was the language of the earlier part of the 
Koryó dynasty from the 930s till the mid-thirteenth century. 


The languages that were spoken before or during Unified Silla are only poorly attested. 
Very likely these languages included Para-Koreanic and Para-Japonic, but no relevant 
material is available for the purposes of this study, which is why they have been excluded 
here altogether. Old Korean was followed by Middle Korean, more exactly Early Middle 
Korean (10^ to 14™ centuries) and Late Middle Korean (15^ and 16" centuries), roughly 
divided by the invention of the Hangul script in 1446 (Sohn 2012). 

Koreanic had contact with Southern Tungusic, Japonic, and Sinitic, which forms a 
very strong ad- and superstrate. Both Japonic and Koreanic derive a large amount of 
vocabulary from Sinitic. Today, English is an important contact language as well. 


2.8 (Khitano-)Mongolic 


There are a dozen Mongolic languages and all are spoken in Northeast Asia except for 
Kalmyk (an aberrant dialect of Oirat) and Moghol in Afghanistan (Janhunen 2003e, 2006). 
Apart from the Mongolic languages proper, there is what has been termed Para-Mongolic 
(Janhunen 2003c; 2012a), i.e. sister languages of the Proto-Mongolic lineage (e.g., Khi- 
tan). All known Para-Mongolic languages are extinct and given the scarce material, Para- 
Mongolic languages will be excluded from the discussion. The age of the Mongolic lan- 
guage family, i.e. the time of the break-up of the Proto-Mongolic unity, is thought to be 
only about 800 years (e.g., Janhunen 2012b: 3). If one includes Para-Mongolic, the family 
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must be much older, but Janhunen’s (2012d: 8) estimate of an age of about 1500 to 2500 
years before present shows that the details are far from clear. In addition to the modern 
Mongolic languages there are historical records of older stages, notably so-called Middle 
Mongol, which “is the technical term for the Mongolic languages recorded in documents 
during, or immediately after, the time of the Mongol empire(s), in the thirteenth to the 
early fifteenth centuries.” (Rybatzki 2003b: 57) In addition, there is written Mongol, a 
literary language written with the Uyghur alphabet that has a history of about 800 years 
and exhibits several archaic features (Janhunen 2003f). The recently partly deciphered 
Huis Tolgoi inscription from Mongolia seems to represent a form of early Mongolic and 
is considerably older than Middle Mongol (e.g., Vovin 2017). The “homeland” problem is 
notoriously difficult for many language families. However, for Mongolic it quite clearly 
was located somewhere in present-day northeastern Mongolia, the place where the Mon- 
golic expansion had its starting point (Janhunen 2003e: xxxiv). But Proto-Mongolic itself 
formed a larger family with Para-Mongolic, and the question about the original location 
of this proto-language of Proto- and Para-Mongolic (Janhunen 2012a: 114 proposes the 
name Khitano-Mongolic, also adopted here, and Shimunek 2014; 2017 Serbi-Mongolic), is 
less easy to answer. Janhunen (2012d: 10) assumes that it was located further to the south 
in present-day Liaoning or eastern Inner Mongolia: 


There is a particularly clear parallelism in the expansion of the Mongolic [includ- 
ing Para-Mongolic] and Tungusic language families. Once they had occupied their 
protohistorical positions on both sides of the Liao basin, they both assumed a gen- 
eral northward trend of expansion. In the light of the available information on the 
history and protohistory of the region, the Mongolic homeland has to be placed in 
southwestern Manchuria (Liaoxi), while the Tungusic Homeland can hardly have 
been located anywhere else but in southeastern Manchuria (Liaodong), though 
quite possibly also extending to the northern part of the Korean Peninsula. (my 
square brackets) 


On Tungusic, see $2.10. Janhunen's assumption of a Pre-Proto-Mongolic homeland situ- 
ated roughly in eastern Manchuria is corroborated by some historical facts, such as the 
Khitan Liao-dynasty (L, 916-1125 CE) that roughly derived from this region. 

Mongolic in general shows strong influence from Turkic languages and vice versa 
(Schónig 2003). Individual Mongolic languages participated in different linguistic areas 
that sometimes overlap and display a different strength of convergence. Shirongolic is 
an integral part of the so-called Amdo Sprachbund. Dagur, together with the two Tungu- 
sic languages Solon and Oroqen, formed a small linguistic area for itself, but during the 
Qing-dynasty (1636-1911) were also under the strong influence of yet another Tungusic 
language, Manchu. Similar to Tungusic, Mongolic languages today can be classified as to 
whether they are under the influence ofthe national language of Russia (Kalmyk, Buryat) 
or China (Dagur, Shirongolic etc.). But unlike Tungusic, this only partly applies to the 
Mongolic languages spoken in "Outer Mongolia", where Russian influence appears to be 
receding, and does not apply at all to Moghol in Afghanistan. A national language itself, 
Mongolian of course influences all Mongolic languages spoken in Mongolia. 
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2.9 Trans-Himalayan (Sino-Tibetan) 


It has been pointed out that the name Sino-Tibetan is somewhat misleading and it will not 
be used in this book. The traditional view, as advocated by LaPolla (2013), for example, 
claims that Sino-Tibetan has two main branches, Sinitic and Tibeto-Burman. According 
to this view, the origin of Sino-Tibetan (and not only of Sinitic) is usually said to have 
been around the Yellow River. Some of the justified criticism to previous approaches to 
the family has been aptly summarized by Blench & Post (2014: 93): 


"Reconstructions" have been proposed which have failed to take many languages of 
high phyletic significance into account; these forms have been repeatedly quoted 
without remark in the literature, in the process gaining a lustre they hardly de- 
serve. Sino-Tibetan has no agreed internal structure, and yet its advocates have 
been happy to propose dates for its origin, expansion and homeland in stark contra- 
diction to the known archaeological evidence. A focus on “high cultures" (Chinese, 
Tibetan, Burmese) has led to an emphasis on these languages and their written 
records, something wholly inappropriate for a phylum where an overwhelming 
proportion of its members speak unwritten languages. 


Therefore, the more adequate and neutral name Trans-Himalayan (van Driem 2014) will 
be employed here instead, which does not imply a split into only two main branches and 
suggests an origin and center of diversity further to the southwest. In fact, most Trans- 
Himalayan languages are located in South or Southeast Asia. According to van Driem 
(2014) and Blench & Post (2014), the geographical distribution of the different branches 
suggests an origin of the whole language family in the eastern Himalayas. Under this 
assumption, Sinitic would be the northernmost of many different branches of the family. 
Needless to say, this innovative view is not yet accepted by all researchers and deserves 
further investigation (see LaPolla 2016 for a discussion). 

This study only includes languages from three of a total of perhaps 42 different sub- 
branches of Trans-Himalayan (van Driem 2014), namely Sinitic, Tibetic (a subbranch 
of Bodish), and Qiangic. The age of Sinitic depends on the definition. Traditionally, old 
stages of Chinese are divided into Old Chinese and Middle Chinese. However, a new ap- 
proach developed by Norman (2014), which focuses on evidence from the spoken lan- 
guages, makes a distinction into Common Dialectal Chinese (CDC, the proto-language of 
all modern Chinese languages except Min) and Early Chinese (EC, the proto-language of 
Min, CDC etc.). Roughly speaking, CDC can be compared with the Romance languages 
and Early Chinese with Italic. If Sinitic refers to CDC and its descendants, then the age 
is perhaps about 2000 years. If, however, Sinitic refers to the whole branch of Trans- 
Himalayan (i.e., (pre-)EC), then Sinitic is perhaps some 1500 years older. The latter view 
will be adopted here. However, Norman was reluctant to estimate the ages of the two 
proto-languages. While Norman's is perhaps the best approach to the history of Chinese 
yet, this study necessarily takes a pragmatic stance. Compared with Indo-European, the 
reconstruction of Chinese is still in its infancy and goes beyond the possibilities of this 
study, which will mostly be focusing on modern Chinese languages. In order to cap- 
ture some of the history of Chinese, I will refer to the recent study by Baxter & Sagart 
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(2014a,b), who employ the term Old Chinese as a more or less useful cover term for the 
earlier period of Sinitic: 


We use the term “Old Chinese’ in a broad sense to refer to varieties of Chinese used 
before the unification of China under the Qin 2 dynasty in 221 Bce. The earliest 
written records in Chinese are oracular inscriptions on bones and shells from about 
1250 BCE (in the late Shang fj dynasty, which was overthrown by the Zhou JË in 
1045 BCE), so this is an interval of about 1,000 years. Obviously there must have 
been many varieties of Chinese during this period, widely distributed in time and 
space. (Baxter & Sagart 2014a: 1) 


Throughout its history, Sinitic had intense language contacts with many surrounding 
languages (see Matthews 2010). Especially intense was the influence on Korean and Jap- 
anese, which derive a large amount of their vocabulary from Sinitic. Mandarin today is 
the dominant language of China, and has already started to replace several minority lan- 
guages throughout the country. Just like Russian dominates the northern half of NEA, 
Mandarin has a leading position in the southern half. 

Following Tournadre (2014), it is perhaps best not to speak of the Tibetan, but of the 
Tibetic, branch, which goes back to Old Tibetan (ca. 7th to 9th century CE) as its proto- 
language, which is closely related to the Classical Tibetan language: 


‘Classical Tibetan’ is an idealization, referring both to over a millennium of writ- 
ten history and to a tradition of prescriptive grammar which many of the authors 
of the texts, in some cases down to the present, made greater or lesser efforts to 
conform to. [...] The term ‘Old Tibetan’ is used to refer to written material from be- 
fore about 1000 CE, primarily inscriptions and documents found in the Dun-huang 
caves (DeLancey 2003: 255f.) 


Today Tibetic encompasses about 200 different varieties distributed over an extremely 
large area, which can, according to Tournadre (2005), be classified into eight “sections”. 
Only some varieties from the eastern (Baima, Cone, Zhongu) and northeastern sections 
(Amdo Tibetan, gSerpa) will be included here. Amdo Tibetan is of special importance for 
this study because of its dominant position in the Amdo Sprachbund (Sandman & Simon 
2016, §3.5). 

Whether Qiangic is a valid subgroup of Trans-Himalayan, and which languages it 
should cover, is an ongoing debate. Chirkova (2012) argues that it should be reconceptu- 
alized as an areal rather than a genetic group of Trans-Himalayan languages. Without 
a final solution to the problem at hand, this study retains the common designation as 
Qiangic, which is first and foremost a pragmatic decision. In NEA only one language is 
usually classified as Qiangic: 


Tangut (also known as the Xixia language) is an extinct Tibeto-Burman language 
that was spoken in the Xixia empire that existed from 1038 to 1227 in northwestern 
China. The language was buried in oblivion till 1908 when the Russian geographer 
P.K. Kozlov discovered the ruins of a Tangut city at Khara Khoto. (Gong Hwang- 
Cherng 2003: 602) 
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Baima, tentatively classified as Tibetic here, is sometimes also treated as a Qiangic 
language (Chirkova 2012: 139). 

There have been many attempts to connect Trans-Himalayan with other language 
families, none of which is widely accepted. A Sino-Tibetan-Austronesian hypothesis that 
also includes Tai-Kadai as a branch of Austronesian is currently being debated (see Sagart 
2016), but does not seem to be gaining acceptance. 


2.10 Tungusic 


Tungusic is the name of a language family that includes about a dozen to twenty different 
languages distributed over a vast area in Siberia and Northern China. Experts do not 
agree on the exact number of languages, primarily because of the complex network of 
dialects and mutual influence. Instead of Tungusic, some researchers prefer the name 
Manchu-Tungusic (e.g., Pevnov 2012), but I will continue to use the name Tungusic as a 
convenient label for the whole language family. The name Tungusic historically referred 
to the Evenki or the Even and their languages, but today does not designate any specific 
variety. In addition, if understood in the old sense, the name Manchu-Tungusic actually 
refers to only two or three of many more languages. In addition, the term suggests a 
primary split of the language family into Manchu and Tungusic, which is not necessarily 
accurate (e.g., Ikegami 1974; Georg 2004; Janhunen 2012d; Hölzl 2015a; 2017a). What is 
more, the name Tungusic belongs to a long tradition of referring to the whole language 
family (e.g., Benzing 1956). 

Tungusic today is usually classified into four different groups (Ikegami 1974; Georg 
2004), which can be called Jurchenic, Nanaic, Udegheic, and Ewenic (Janhunen 20124). 
According to one hypothesis that will be followed here, the first two form the southern 
Tungusic branch, and the latter two the northern branch. Janhunen (1996; 2005; 2012d) 
assumes that Proto-Tungusic was spoken in southern Manchuria, east of the Liao river 
and partly in the north of Korea: 


The linguistic facts suggest that the Tungusic family represents a classic case of 
language spread from a relatively compact homeland. Against the overall ethno- 
historical picture of Northeast Asia, it appears likely that the Tungusic homeland 
was located in the region comprising Southern Manchuria and Northern Korea, 
the historical habitat of the Jurchen-Manchu. From here Tungusic expansion took 
Tungusic to the Armur basin, where Nanai, Udeghe, and Ewenki branches sub- 
sequently emerged. These initial expansions of Tungusic may have taken place 
between 2000 and 1000 years ago (Janhunen 1996: 216-233). (Janhunen 2005: 39) 


But a more plausible location appears to have been further north, as has also been 
claimed by Pevnov (2012) and Vovin (2013b). An educated guess for an original location 
of Tungusic should probably pinpoint the confluence of the Amur, the Sunggari, and 
the Ussuri. From this region Jurchenic expanded southwards along the Sunggari and the 
Ussuri, Nanaic followed the lower Amur northwards, Udegheic spread along the eastern 
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tributaries of the lower Amur and the Ussuri, and Ewenic speakers migrated along the 
Amur river towards the northeast and to some extent followed the left tributaries such as 
the Bureya and the Zeya. Parts of Ewenic (mostly Evenki and Even) then rapidly covered 
almost all of Siberia. This expansion of Ewenic has also been recognized by Janhunen 
(2005: 39): 


The modern distribution of Tungusic is largely the result of the secondary expan- 
sion of the Ewenki branch, which very probably began from the Middle Armur 
region no more than 1000 years ago. This expansion spread Tungusic over the 
whole of Siberia, from the Okhotsk Sea in the east to the Yenisei basin in the west, 
and from Lake Baikal in the south to the Arctic Ocean in the north. The expan- 
sion has continued until recent times, especially in Northeast Siberia. Territories 
reached only in the 19th century include Kamchatka (Ewen) and Sakhalin (Siberian 
Ewenki). 


Janhunen (2005) is right in pointing out the internal homogeneity of both Evenki and 
Even, which indicates a very recent spread. Even today the number of Ewenic languages 
is highest in Manchuria. 

A recent study found evidence that the direct ancestors of some Tungusic-speaking 
peoples have been living in Manchuria for at least 8000 years: 


We report genome-wide data from two hunter-gatherers from Devil’s Gate, an 
early Neolithic cave site (dated to ~7.7 thousand years ago) located in East Asia, 
on the border between Russia and Korea. Both of these individuals are genetically 
most similar to geographically close modern populations from the Amur Basin, all 
speaking Tungusic languages, and, in particular, to the Ulchi. (Siska et al. 2017: 1) 


This is no proof, of course, that the ancestors of the Tungusic language family were spo- 
ken in the area as well. However, the genetic continuity might suggest that there may 
not have been a language shift from some unknown languages to the Tungusic languages 
family (or its predecessor), which would be expected to leave clearer traces of genetic 
admixture. Another recent genetic study, for example, found that the Udihe appear to 
be “the result of admixture between local Amur-Ussuri populations and Tungusic popu- 
lations from the north? (Duggan et al. 2013: 1) Unfortunately, it is still too early to draw 
any substantial linguistic conclusions based on these results. 

There are too many instances of language contact of Tungusic languages all over NEA 
to be summarized here in detail. Manchu used to be an important superstrate language 
for all languages in Manchuria and also had a certain impact on Mandarin (e.g., Tsuma- 
gari 1997). Manchu itself has a pronounced Mongolic, Para-Mongolic, and perhaps Kore- 
anic adstrate. Sibe had contact with Mongolic languages such as Khorchin and, the group 
of speakers who were relocated to Xinjiang in 1764, with several Turkic languages such 
as Uyghur. Several Tungusic languages had contact with Amuric languages along the 
lower Amur. Evenki had contact with several Mongolic languages such as Buryat, with 
Nivkh on Sakhalin, as well as with Yakut, Yeniseic, Yukaghiric, and Samoyedic. Even had 
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contact with Chukotko-Kamchatkan as well as Yakut, and partly replaced Yukaghiric. 
Orogen and especially Solon had an almost symbiotic relation to the Mongolic language 
Dagur (e.g., Janhunen 1997). The same is true for the two Khamnigan Evenki dialects with 
Khamnigan Mongol (e.g., Janhunen 1991; Janhunen 2003b). Before the advent of Russian 
and Mandarin influence, Khitano-Mongolic exerted the most important influence over 
all of Tungusic (Doerfer 1985). 


2.11 Turkic 


Turkic languages are widespread today, from the Arctic Sea in the north to Qinghai in 
the south and from Manchuria in the east to Turkey in the west (excluding recent mi- 
grations to Germany, for instance). The spread of Turkic all over Eurasia had its begin- 
nings in southern Siberia and northern Mongolia, where the oldest Turkic records, the 
Orkhon inscriptions, were found (Golden 1998). Turkic has perhaps six main branches, 
Oghur, Khalaj, Siberian, Uyghur-Karlak, Kipchak, and Oghuz (Johanson 1998: 81f.; Johan- 
son 2006a: 161f.). First Oghur, today only represented by Chuvash in European Russia, 
and then perhaps Khalaj (in Iran) split away from the rest. Most languages covered here 
are from the Siberian branch, but languages from all branches except Oghur and Khalaj 
are today located in NEA. This study excludes the by now perhaps extinct archaic Turkic 
language Khotong from Mongolia, for which no data are available to me (Shimunek et al. 
2015: 148). 

The classification above only includes modern Turkic languages, but there are his- 
torically attested varieties of Turkic that will be briefly mentioned as well, notably Old 
Turkic and Chagatay. 


Old Turkic is taken to be the language underlying three corpora. The first one con- 
sists of official or private inscriptions in the runiform script, dating from the sev- 
enth to tenth centuries, in the territory of the second Türk empire and the Uyghur 
steppe empire - preset-day Mongolia - and the Yenisey basin. The second and most 
extensive corpus consists of ninth to thirteenth century Old Uyghur manuscripts 
from northwest China in Uyghur, runiform and other scripts. [...] The third cor- 
pus consists of eleventh-century texts from the Karakhanid state, mostly in Arabic 
script [...]. (Erdal 1998: 138) 


Chaghatay can be defined as a succession of stages of written Turkic in Central 
Asia. In many respects it is also a continuation of earlier stages, notably of Karakha- 
nid Turkic, with Kharezmian Turkic as a transitional stage. It cannot be defined 
as a fixed entity in time and space. Chaghaty sources are a hybrid collection of 
different varieties of Turkic, who from the late fifteenth century onwards more or 
less tried to focus on a specific model known as Classical Chaghatay. (Boeschoten 
& Vandamme 1998: 166) 


Chagathay influenced several written languages, including the Kipchak languages Tatar 
and Kazakh, the Oghuz language Turkmen, and the Uygur-Karluk languages Uzbek and 
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Uyghur (Boeschoten & Vandamme 1998: 168). In fact, the Uygur-Karluk branch is some- 
times also called the Chagatay branch of Turkic. 

The extensive contact between Turkic and other languages has been summarized by 
Schónig (2003) and Johanson (2010). Turkic languages in general had strong contact with 
Mongolic. But individual languages underwent a plethora of contact situations that can- 
not all be summarized here. Yakut had contact with Buryat, and later with Evenki, Yuk- 
aghir, and Nganasan, which led to the emergence of Dolgan. In the southwest there is 
contact with Iranian and in the southeast with Sinitic. In the Amdo region there is a 
strong interaction with Mongolic and Sinitic varieties as well as with Amdo Tibetan. 


2.12 Uralic 


Uralic (e.g., Sinor 1988; Abondolo 1998) is a language family with a very long history 
comparable to that of Indo-European. The primary split of the language family separates 
the Samoyedic languages from Finno-Ugric. Despite the rather small comparative corpus 
between Finno-Ugric and Samoyedic, their genetic relation is usually recognized. Proto- 
Samoyedic perhaps split about 2000 years ago, while Finno-Ugric and Samoyedic had a 
common origin in Proto-Uralic about 5000 years ago (Janhunen 2009: 68). The location 
of the Uralic homeland is disputed, but Janhunen (2009: 71) argues for "the borderline 
between the Ob and Yenisei drainage areas in Siberia" and thus for a region at the edge of 
NEA. Given the connection of Uralic with Yukaghiric (82.14), Pre-Proto-Uralic could even 
have been spoken in NEA. However, only the Samoyedic branch is clearly represented 
in NEA (e.g., Janhunen 1998). 


Listed roughly from north to south, these are (older designations given in parenthe- 
ses): Nganasan (Tavgy), Enets (Yenisei-Samoyed), Nenets (Yurak), Selkup (Ostyak- 
Samoyed), Kamass(ian), and Mator (Motor). The southernmost languages Kamass 
and Mator, are now no longer spoken: Mator was replaced by Turkic idioms during 
the first half of the nineteenth century, and the fact that it is known at all today 
is because of intensive philological work done with word lists; the last Kamass 
speaker died in 1989. [...] only Nenets is spoken by a relatively large number of 
people (some 27,000); Selkup, which has sharp dialectal divisions, has fewer than 
2,000 speakers; Nganasan, some 600; and Enets, perhaps 100. (Abondolo 1998: 2) 


Elena Skribnik (p.c. 2017) informed me that in NEA there are also a few speakers of, 
for example, Estonian. However, such isolated groups will mostly be neglected in this 
study (but see Miestamo 2011 and §5.12.2). Together with Yeniseic, Samoyedic forms the 
western border of the NEA area. Samoyedic may have been spoken by the Tagar culture 
(ca. 1000-200 BCE) in the Minusinsk basin (Janhunen 2009: 72; Parpola 2012: 294), but this 
remains somewhat speculative. Just like Yeniseic, Samoyedic spread along the Yenisei 
northwards, while those varieties left behind were slowly replaced by languages from 
other families. 
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Samoyedic had contact with several Finno-Ugric, Yeniseic, and Turkic languages as 
well as Evenki, Russian, and perhaps some early form of Tocharian. Selkup had an espe- 
cially strong interaction with the Yeniseic language Ket. 


2.13 Yeniseic (Yeniseian) 


Typologically, Yeniseic is the most atypical Siberian language family (§3.5). Today it is 
represented by only one language, namely Ket. But there used to be several other Yeni- 
seic languages (Arin, Assan, Kott, Pumpokol, Yugh) that have since disappeared. Yeniseic 
substrate toponyms, largely river names that have endings such as -ul, -ses, or -det, cover 
a large area from the Irtysh in the west to northern Mongolia in the east and indicate a 
more widespread distribution in the past (Vajda 2009a: 474). The homeland of Yeniseic 
may have been the Altai region, especially the Karasuk culture (1200-700 BCE) (Flegontov 
et al. 2016: 1f.). According to Vajda (2010: 33), less than 100 Ket are still able to speak the 
language. 

There is some evidence to suggest that a Yeniseic language was one of the language of 
the historic Xiongnu (4) in northern China, the main rivals of the Han dynasty (206 
BCE to 220 CE) (cf. Vovin et al. 2016 and references therein). In addition, Vajda (2010) 
has made a strong argument for a genetic connection between Yeniseic and Na-Dene 
languages, called the Dene-Yeniseian hypothesis. Apart from Eskaleut, this would be the 
first language family discovered that connects languages in Asia and the Americas. The 
theory is currently gaining acceptance as new pieces are added to the puzzle (e.g., Vajda 
2013), and, at least for the moment, it seems that there are fewer critics than proponents. 
Nevertheless, more research over the following years will show whether the hypothesis 
can stand the test of time. If Dene-Yeniseian turns out to be a valid genetic unit, there are 
several different possible explanations for its modern distribution. One possibility would 
be to assume a location of the proto-language somewhere in (south)eastern NEA. From 
there, Yeniseic moved westwards, whereas Na-Dene moved northwards to finally cross 
Beringia. But Sicoli & Holton (2014) have recently argued for an alternative that assumes 
an original location in the Beringian area. Yeniseic, according to them, is the result of a 
back-migration into Asia. However, this goes against the general rule of thumb that mi- 
grations in NEA usually follow a south-to-north direction. In any case, the migration of 
Yeniseic down the Yenisei and of Na-Dene from Alaska southwards are widely accepted 
and must be common ground for any additional hypothesis. The question of the time 
depth of the hypothetical Proto-Dene-Yeniseian language remains unsettled for now, 
but must necessarily be many thousand years older than Proto-Yeniseic (see §5.13.4). 


2.14 Yukaghiric 


The term Yukaghiric is employed here to refer to the language family usually called Yuk- 
aghir. However, there are two rather different extant Yukaghiric languages, which is 
why a specialized designation for the language family seems in order to avoid confusion. 


34 


2.4 Yukaghiric 


These two languages are called Kolyma Yukaghir (Odul) and Tundra Yukaghir (Wadul). 
Tsumagari et al. (2007) classify Tundra Yukaghir as “seriously endangered” and Kolyma 
Yukaghir as “moribund” as there are only several dozen elderly speakers left for both lan- 
guages (Matić 2014: 130). Yukaghiric languages were mostly replaced by Even (Tungusic), 
Yakut (Turkic), Chukchi, and Kerek (Chukotko-Kamchatkan), as well as Russian (Slavic). 
War with and exploitation by the Russians, together with smallpox epidemics, decimated 
their number drastically. Where there were an estimated 4500-5000 Yukaghir in the 17" 
century, only 150-200 remained at the end of the 19'" century, but their number has been 
growing again ever since (Rédei 1999: 3; Forsyth 1992: 74-80). 

Yukaghiric languages must have been extremely widespread in northeastern Siberia 
until the 17^ century. According to Volodko et al. (2008), the Yukaghir were even in- 
volved in the formation of the Samoyedic-speaking Nganasan much further to the west. 
Even so, they seem to have reached the northern parts of NEA from a location further 
south. Hákkinen (2012: 93) argues that 


Yukaghir[ic] can be derived from the west, as it was spoken earlier near the Lena. 
We may assume that Yukaghir[ic] at some point in the past migrated down the 
Lena, just as Yakut did later, and that Early Proto-Yukaghir[ic] was spoken some- 
where near the Upper Lena and the region of Lake Baikal, the watershed area be- 
tween the Lena and Yenisei river systems. (my square brackets) 


If Hàkkinen's assumption is correct, this brings Yukaghiric geographically much closer 
to other language families such as Tungusic, Samoyedic, Khitano-Mongolic, Turkic, and 
Yeniseic. A southern origin of the Yukaghir is also corroborated by evidence from mito- 
chondrial DNA analyses (Volodko et al. 2008). Hakkinen’s conclusions are built on an 
assumption of a direct contact of Yukaghir with Uralic languages. Janhunen (2009: 61) 
explicitly denies a connection between Uralic and Yukaghiric. But most researchers do 
not exclude the possibility of a genetic connection (e.g., Pispane 2013) or at least con- 
tact (e.g., Rédei 1999; Aikio 2014). The separation of the two Yukaghiric languages has 
been estimated to date back to about 2,000 years ago (Maslova 2003a: 28), which remains 
rather speculative and might be an overestimation. For instance, personal pronouns in 
the two extant Yukaghiric varieties are basically identical, which would not be expected 
after such a long period of separation. The location of Pre-Proto-Yukaghiric in the south 
of NEA, on the other hand, must be much older and has been tentatively dated to the 
early-middle Holocene (Volodko et al. 2008: 1097) and thus might be much earlier than 
"Early Proto-Yukaghir" as was assumed by Hákkinen (2012: 93). 
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3 Areal typology and Northeast Asia 


Chapter 2 has introduced the languages of Northeast Asia from a genetic perspective, 
ie. classified into language families. The focus in the present chapter is on language 
contact instead and adds an areal perspective to the discussion. The two classifications 
are not always clearly separable, especially at greater time depths (e.g., Nichols 2010; 
Operstein 2015), and given the fact that, naturally, languages from one family can also 
have contact with each other (e.g., Epps et al. 2013). Since an exhaustive presentation 
of all language contact phenomena goes well beyond the possibilities of this study— 
“Language contact is everywhere” (Thomason 2001: 8)—, there will be a focus on some 
points that are especially relevant. 


3.1 Theoretical considerations 


This chapter is concerned with structural diversity, or rather, structural similarity among 
languages. There are several different reasons that languages can be similar, including 
universals, tendencies, chance, genetic inheritance, and language contact (e.g., Aikhen- 
vald & Dixon 2001: 1-3). It seems that all languages around the globe have specialized 
constructions for asking questions, so that this is a linguistic universal and the reason 
this study is possible in the first place. Interestingly, there might even be universal ques- 
tions such as ‘What is your name?’, "Who are you?’, and ‘What is that?’ that are, however, 
expressed differently from language to language.! There may be yet more specific univer- 
sals. Dingemanse et al. (2013: 1) have quite convincingly shown that the repair initiator 
huh? could well be a universal word “not because it is innate but because it is shaped by 
selective pressures in an [enchronic] interactional environment that all languages share: 
that of other-initiated repair" In my opinion, potential universals of this kind have to be 
distinguished from strong tendencies, such as the fact that positive one-word answers in 
a great many languages around the globe contain laryngeal sounds [h] and [?] (Parker 
2006). Take German, for instance, which has the word ja ‘yes’. At first glance, this does 
not contain any laryngeal sounds, but it has many different variations, among which 
one encounters [ja?] with a final glottal stop as well as ingressive [hja|] with an initial 
laryngeal fricative (my knowledge). A similar tendency is for languages to have rising 
intonation in polar questions, which is common but by no means universal. Hawai'i Cre- 
ole English, for example, has falling intonation instead (Veluppilai 2012: 353). A factor 
that should not be underestimated is chance resemblance. An example from the category 


!David Gil (p.c. 2018) informs me that he is working on a typology of the question ‘What is your name?’ on 
which see also Idiatov (2007); Hólzl (2014b) and §§4.3.1, 5.6.3. 
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of questions are the polar question markers -(V)? in Hup (Nadahup, Epps 2008: 784ff.) 
in South America and -? in Crow (Siouan, Graczyk 2007: 391) in North America that at 
least in some instances are basically identical. To my knowledge there does not appear 
to be a general tendency for question markers to exhibit laryngeal sounds, as far as we 
know the two languages do not share a common ancestor, and there certainly was no 
contact between them. This only leaves pure coincidence to account for this similarity. 
An example for a chance resemblance in the interrogative system would be Tocharian 
(Indo-European) kos and Dolgan (Turkic) kas ‘how much’. Given that both interroga- 
tives and question markers tend to be very short, chance resemblance is extremely hard 
to distinguish from genetic inheritance and language contact. As seen in Chapter 2, ge- 
netic inheritance refers to languages of one and the same language family that go back 
to one proto-language and therefore preserve features that are similar to each other. The 
two Tungusic languages Evenki and Even, to take a random example, shared a common 
ancestor only several centuries ago and therefore display many similarities such as an 
almost identical question marker =Ku (§5.10.2). The last of the explanations for the sim- 
ilarity between different languages is language contact. The term language contact, of 
course, is nothing but a metaphorical abstraction of what is actually an integral part of 
the complex interaction of different human beings. But certainly it serves its purpose 
to facilitate our discourse on the topic. Language contact presupposes a linguistic inter- 
action of speakers of different languages (Thomason 2001: 1f.). Perhaps every linguistic 
interaction has certain properties that qualify as language contact. However, language 
contact is usually identified through observable results such as the borrowing of ele- 
ments. Contact may be either direct or indirect. The latter can be further divided into 
the contact of two languages with a transmitting language on the one hand, and a com- 
mon contact language of two languages on the other. 

The outcome of language contact differs from instance to instance. Thomason (2001: 
10) suggests “a hierarchical set of typologies, starting with a three-way division at the 
top level into contact-induced language change, extreme language mixture (resulting in 
pidgins, creoles, and bilingual mixed languages), and language death.” In NEA there are 
examples of all three kinds, but the details of Thomason’s (2001: 60) typology are too 
complex to be repeated here in full. Language shift, which today is an extremely com- 
mon phenomenon around the globe and in NEA, will for the most part be excluded for 
the lack of relevant data concerning the effects on the grammars of questions. Exam- 
ples for extreme cases of language contact found in NEA include two extinct pidgins 
(Chinese Pidgin Russian, Govorka, §5.5), some creolized languages (e.g., Gangou, Wu- 
tun, Tangwang, Hezhou, §5.9), several mixed languages (Eynu, §5.11, Mednyj Aleut, §5.4, 
some Tungusic languages §5.10, §6.3, and an Ainu-Itelmen hybrid), and perhaps some 
slightly less extreme cases such as Mandarin or Manchu (e.g., McWhorter 2007). 

Contact-induced change has several subtypes (relabeling, calquing etc.), but arguably, 
regarding the grammar of questions, the most important case is borrowing, simply put 
the transfer of a certain element from one language to another. But how do we actually 
know that a linguistic item in a given language can be explained by language contact 
rather than genetic inheritance? Let me illustrate this with an example from the Tungu- 
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sic language Uilta (§5.10.2). Uilta has a content question marker =ga ~ =ka. A comparison 
with closely related languages such as Nanai shows that the marker is not present, in 
fact, content questions in Nanai remain unmarked. The fact that related languages do 
not show this marker in most cases rules out an explanation in terms of genetic inher- 
itance. Then the form might simply be an innovation found in Uilta, but no plausible 
etymology is known to me. Thus, Uilta perhaps borrowed the question marker from a 
surrounding language. Uilta is spoken on Sakhalin where it is known to have had con- 
tact with the neighboring language Nivkh (Yamada 2010, §5.2.2). In fact, Nivkh has overt 
content question markers, one of which has the form =na. Of course, Uilta could also 
have borrowed the question marker from other surrounding language families such as 
Japonic. Tsuken, for example, has a content question marker =ga. However, Tsuken is 
spoken in the Ryükyüan Islands several thousand miles south of Sakhalin. This geograph- 
ical distance makes a connection extremely implausible, because the speakers of Uilta 
and Tsuken quite certainly had no direct contact with each other. But what about Japa- 
nese, which was once spoken on Sakhalin and has a question marker ka 7^ that can also 
be found in content questions? First of all, Uilta had much more longstanding and inti- 
mate contact with Nivkh than with Japanese. However, in order to refute this possibility, 
more information on Japanese and Uilta is in order. Old Japanese already possessed the 
question marker, which had more or less the same form, but in Uilta there are further 
forms such as =ge (alternatively written with a schwa a and an optional long vowel). 
Given that Uilta has vowel harmony in which a stands opposed to e (Tsumagari 2009b: 
3), this appears to be an innovation and the form might still derive from either Nivkh 
or Japanese. However, the integration of the question marker into the morphological 
system suggests a relatively early borrowing, which makes a comparison with Japanese 
much less likely. Furthermore, Nakanome (1928: 50ff.) mentions a form that was written 
as «nó». The pronunciation of this form must be [na], as a comparison of Nakanome 
(1928) with Ikegami's (1997) modern dictionary suggests, e.g. «ónnó» = [anna] ‘mother’, 
<nala> = [naala] ‘hand’. The existence of the velar nasal makes a comparison with Nivkh 
much more likely than with Japanese. The fact that both Nivkh and Uilta, but not the 
surrounding languages, overtly mark polar and content questions differently—i.e. there 
is a similarity in type—confirms this hypothesis (e.g., Hólzl 2015e). This typological par- 
allel has also recently been observed by Pevnov (2016: 59f.). On the contrary, Japanese 
allows the marker ka in both polar and content questions. For reasons of space, this pro- 
cedure will not be given in full detail for every potential instance of borrowing identified 
in Chapter 5. A list of all the borrowed elements of the grammars of questions in NEA 
found throughout this study is given in Chapter 6. In several cases the details will have 
to be discussed by experts of the individual languages. 

One of the central concepts of areal linguistics is the heavily disputed notion of a 
linguistic area or sprachbund. The best summary of previous approaches can be found in 
Campbell (2006: 18), whose rather skeptical conclusion is the following. 


Every ‘linguistic area’, to the extent that the notion has any meaning at all, arises 
from an accumulation of individual cases of ‘localized diffusion’; it is the investi- 
gation of these specific instances of diffusion, and not the pursuit of defining prop- 
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erties for linguistic areas, that will increase our understanding and will explain 
historical facts. 


There is a strange dissonance between theoretical approaches that usually take a negative 
stance on the concept (e.g., Dahl 2001; Bisang 2010) and the widespread use of the term 
for individual areas such as the Amdo Sprachbund. This study acknowledges the funda- 
mental theoretical problems of the concept, but takes a pragmatic approach. The term 
linguistic area is taken as a useful label if it is not meant to indicate clear-cut boundaries 
or absolute homogeneity. Like many linguistic phenomena, linguistic convergence is ob- 
viously a matter of degree (cf. Langacker 2008: 13) and there is no problem in calling areas 
of strong convergence a sprachbund or linguistic area. As a rule of thumb, an area should 
be characterized with the help of features that are not very common cross-linguistically, 
and that are not shared with surrounding areas. NEA is surrounded by several possible 
areas such as the Greater Himalayan Region to the south (Kraaijenbrink et al. 2009) and 
the Pamir-Hindukush Sprachbund (Novak 2014: 82) as well as the Araxes-Iran Linguistic 
Area to the southwest (Stilo 2015), and Mainland Southeast Asia (Enfield & Comrie 2015) 
to the southeast. Unfortunately, with only the exception of MSEA (Enfield & Comrie 
2015), the definition of all of these areas is quite problematic. Nevertheless, the fact that 
the entire southern and southeastern boundary is marked by mountains teeming with 
linguistic diversity indicates that they form an accretion or residual zone (Nichols 1992; 
1997; 2015, see §3.4) that functions as some kind of boundary. The most difficult problem 
is the identification of a western boundary (Heggarty & Renfrew 2014a: 873). Immedi- 
ately to the west of NEA live the speakers of the Uralic, more precisely Finno-Ugric, 
languages Khanty and Mansi that are sometimes collectively called Ob-Ugric. Their ge- 
netic classification is disputed, with some arguing that they belong to a single branch and 
others for a classification into two different branches that had strong mutual contacts, 
called Khantic and Mansic by Janhunen (2009: 65). It is difficult to consider these two 
languages as forming a useful western boundary. But the Western Siberian Lowland to- 
gether with Kazakhstan to its south is a region of low linguistic diversity (a spread zone, 
Nichols 1992), which contrasts with the adjacent areas of NEA along the Yenisei. Located 
to the west of the Ural mountains, and thus separated from NEA by the Western Siberian 
Lowland, lies the Volga-Kama Area (see Manzelli 2015). This is an area of strong linguis- 
tic convergence between several Finno-Ugric and Turkic languages (see §5.11, §5.12). If 
one was to extend NEA to include all of the area to the east of the Ural mountains into, 
say Northern Asia (Nichols 1992: 25f.), the Volga-Kama area would certainly function 
as a better western boundary than does Ob-Ugric. Nevertheless, several languages with 
affinities to NEA, notably Finnish or Turkish would still be located to the west of the 
Volga-Kama Area. 

For practical purposes, Eurasia will be treated as a macro-area (§3.2) that contrasts 
relatively sharply with Mainland Southeast Asia (§3.3) and contains a meso-area called 
Northeast Asia (§3.4). NEA in turn encompasses several possible micro-areas such as the 
so-called Amdo Sprachbund (§3.5) only some of which will be mentioned in this chapter. 
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3.2 The Eurasian macro-area 


As is well-known, NEA is part of a large Eurasian area that is characterized by several 
dominant features (Table 3.1). This area includes most of Eurasia but not Mainland South- 
east Asia (MSEA), parts of Europe and parts of the Near East. There is some variation in 
the geographical distribution of these features, but notably NEA invariably shares all of 
them. The variation concerns the periphery of the Eurasian area such as Europe. 

The usefulness of some of these word order features for the identification of linguistic 
convergence is somewhat reduced by the existence of implicational hierarchies connect- 
ing several of them (e.g., Bisang 2010: 422; Dryer 2013a). Nevertheless, they define a rela- 
tively clear-cut boundary towards the southeast. A possible further trait of this Eurasian 
Area is the existence of K-interrogatives (§6.2.1). 


3.3 Mainland Southeast Asia 


The sharpest contrast of NEA with other areas is that with Mainland Southeast Asia 
(MSEA), the adjacent region to the southeast, which has recently been defined as 


the area occupied by present day Cambodia, Laos, Peninsular Malaysia, Thailand, 
Myanmar, and Vietnam, along with areas of China south of the Yangtze River. Also 
sometimes included are the seven states of Northeast India, and—although here 
the term ‘mainland’ no longer applies—the islands from Indonesia and Malaysia 
running southeast to Australia and West Papua (Enfield & Comrie 2015: 1) 


MSEA is widely accepted as a region of strong convergence of five different language 
families, namely Trans-Himalayan (Sino-Tibetan), Tai-Kadai, Hmong-Mien (Miao-Yao), 
Austroasiatic, and Austronesian. My definition of NEA excludes the Yangtze watershed, 
a part of which is likely the historical homeland of the Hmong-Mien languages (Ratliff 
2010: 241) that clearly belong to the MSEA area. Moreover, Sinitic languages show an in- 
ternal split between northern and southern varieties (e.g., Ramsey 1987: 19-26; Matthews 
2010: 760f.). In a certain sense, the distinction between Mandarin and Southern Sinitic 
varieties is symptomatic for the difference between NEA and MSEA. Mandarin is rather 
homogeneous and is spread over a vast area ranging from Yunnan in the Southwest 
to Heilongjiang in the Northeast and from Jiangsu in the east to Xinjiang in the west. 
Southern Sinitic, on the other hand, is limited to a much smaller geographical area but 
nevertheless shows extremely strong internal variation with many mutually unintelligi- 
ble varieties (Kurpaska 2010). 


There is a qualitative difference between these two areas. The Mandarin area, on 
the one hand, is unusually uniform; virtually all of the dialects spoken there are mu- 
tually intelligible—or very nearly so. [...] But the non-Mandarin area is extremely 
varied, and within it sharply divergent forms of speech are often separated by only 
a few miles. (Ramsey 1987: 21) 
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Table 3.1: Some areally dominant features of NEA shared with neighboring 


areas 


Feature 


Approximate area 


Source 


SOV (SV & OV) 


noun-adposition 
(NAdp) 


genitive-noun 
(GenN) 


adjective-noun 
(AdjN) 


demonstrative-noun 
(DemN) 


numeral-noun 
(NumN) 


relative clause-noun 
(RelN) 


degree word-adjective 
(DAdj) 


non-initial interrogative 


morphological case 
marking 


suffixing in inflectional 
morphology 


m-T-pronouns present 


Eurasia, excluding MSEA and Europe 


Eurasia, excluding MSEA, most of 
Europe, and parts of the Near East 


Eurasia, excluding southern MSEA, 
western Europe, and the Near East 


Eurasia, excluding MSEA, western 
Europe, and the Near East 


Eurasia, excluding MSEA and 
northwestern Europe 


Eurasia, excluding parts of MSEA 


Eurasia, excluding parts of MSEA, parts 
of South Asia, Europe, and the Near 
East 


Eurasia, excluding parts of MSEA 


Eurasia, excluding Europe 


Eurasia, excluding western Europe and 
MSEA 


Eurasia, excluding MSEA 


large parts of Eurasia, excluding MSEA 
and its adjacent areas to the northeast 


Dryer 2013i 


Dryer 2013c 


Dryer 2013f 


Dryer 2013b 


Dryer 2013e 


Dryer 2013g 


Dryer 2013h 


Dryer 2013d 


Dryer 2013k 


Iggesen 2013 


Dryer 2013m 


Nichols & Pe- 
terson 2013 


and west 


42 


3.4 Northeast Asia 


But Mandarin also differs from most of Sinitic in structure. Southern Sinitic exhibits 
stronger affinities to Southeast Asian languages than does Mandarin, which has been 
more strongly influenced by languages in NEA. There is a debate as to whether the spe- 
cial structure of Mandarin can be explained by “Altaicization”, i.e. influence from Turkic, 
Khitano-Mongolic, and Tungusic (Hashimoto 1986), or reduction due to non-native ac- 
quisition of speakers of languages in nowadays northern China (McWhorter 2007: 104- 
137). But in any case, this can be labeled an areal feature that separates Mandarin from 
the rest of Sinitic. Following an extensive discussion, de Sousa (2015: 429) concludes the 
following: 


Some studies on the MSEA linguistic area leave out the languages in China. This is 
unwise, as the centres of diversity for the Kra-Dai and Hmong-Mien families are 
still in Southern China, and the Southern Sinitic languages also have many MSEA 
linguistic traits. Studies of the MSEA linguistic area would benefit immensely if 
the Southern Sinitic languages, the Far-Southern Sinitic languages in particular, 
are included in the MSEA linguistic area. 


Within the human genome, too, there is a marked difference between Northern and 
Southern Han populations, the dividing line of which roughly coincides with the Yangtze 
river (e.g., Zhao Yong-Bin et al. 2014 and references therein). As is well-known, there is 
also a stereotypical division into North and South as perceived by the Chinese them- 
selves that at least in part has a basis in actual facts such as the predominant cultivation 
of wheat and rice, respectively (e.g., Eberhard 1965: 601f.). My approach thus stands op- 
posed to Heggarty & Renfrew (2014a: 870), who classify the linguistic landscape of East 
Asia around a “Chinese core” into a northern, a Sinitic, and a southern zone. Of course, all 
Sinitic languages share certain inherited properties. Perhaps, Sinitic and especially Man- 
darin may thus be better conceptualized as a transitional zone between MSEA and NEA 
(Dryer 2003: 48ff.; Comrie 2008). However, in stark contrast to Northeast Asia, Mainland 
Southeast Asia (MSEA) generally has the following word order features: SVO (SV & VO), 
AdpN, NGen, NAdj, NDem, NNum, NRel, AdjD (de Sousa 2015: 366). Languages in MSEA 
usually lack inflectional morphology and have no sign of m-T-pronouns. Of the features 
listed in Table 3.1, MSEA only shares the non-initial interrogatives. However, for this 
southeastern neighbor a much longer list of distinguishing linguistic features, such as 
the lack of a voiced [g] or the existence of complex tone systems, has been summarized 
by Enfield & Comrie (2015: 7f.). At least for some of them there is no clear-cut bound- 
ary to neighboring areas. For instance, Mandarin, Manchu, and Japanese share a similar 
syllable structure with only very few possible final consonants. In Manchu the only ex- 
ceptions are ideophones, which is yet another feature that is not unique to MSEA but 
shared with many languages in NEA as well. 


3.4 Northeast Asia 


In terms of language diversity and phylogenetic diversity, MSEA and NEA show strik- 
ingly different patterns as well (Table 3.2). Southeast Asia is home to only five language 
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families, but in its broadest definition encompasses almost 600 languages. During the 
preparation of this study it became increasingly clear that an exact number of languages 
cannot possibly be given for NEA. There is a constant fluctuation of languages spoken 
by tourists, exchange students, foreign workers, etc. But even if one leaves aside this 
problem, it is by no means clear at what point a dialect should be counted a language 
or at what point a language should be considered extinct. For instance, the northern 
Tungusic languages Evenki, Even, Negidal, Oroqen, and Solon, each of which has strong 
internal dialectal variation, as well as the extinct language Arman form a complex net of 
dialect continua. If one agrees with the traditional point of view and considers Arman 
a dialect of Even, the language as such never went extinct (cf. Doerfer & Knüppel 2013). 
Evenki alone has about 50 different dialects and experts disagree on whether Oroqen 
dialects should be included in the list or not (e.g., Whaley & Li 2000; Janhunen 2012d: 
7). Given the rapid shift of speakers of both Evenki and Oroqen to Russian and Chinese, 
respectively, it is often only the older generation that can speak the languages. In some 
cases no fluent speaker is left, but some relics of the language nevertheless remain in the 
form of individual expressions or passive speakers. Clear-cut distinctions in these cases 
are neither feasible nor desirable (cf. Langacker 2008: 13). Leaving aside this fluctuation, 
most of the dialects, and clearly extinct languages, NEA may be estimated to be home 
to between 120 and 150 languages. However, NEA shows much more diversity in the 
number of language families than does MSEA. 


Table 3.2: Comparison of language and phylogenetic diversity in MSEA (En- 
field & Comrie 2015: 6) and NEA, excluding historically attested languages (this 


study) 
Core MSEA Greater MSEA NEA 
language families 5 5 14 
languages 280 583 ca. 120-150 


Of course, (Greater) Mainland Southeast Asia actually encompasses more than five 
language families if one includes all small language families (or “isolates”) such as (ex- 
tinct) Kenaboi, Shom Peng (perhaps Austroasiatic), (extinct) Great Andamanese, or On- 
gan (Jarawa-Onge) (e.g., Hammarstróm et al. 2016). The phylogenetic diversity of NEA 
is also much higher than that of the entire landmass to the west. Excluding extinct lan- 
guages such as Etruscan and the relatively recent migrations from other parts of the 
world, there are only representatives of five language families in Europe today, namely 
Indo-European, Uralic, Basque, Afroasiatic (Maltese), and Turkic. The Caucasus alone 
adds three more families, but even so, NEA still exhibits much more phylogenetic diver- 
sity. Anderson (2010: 137) goes so far as to call the eastern part of NEA, where represen- 
tatives of 12 of the 14 language families are spoken, a language hot spot with a ^high level 
of unique phylogenetic linguistic diversity endemic to the region". Of course, if a macro- 
family such as Transeurasian (Robbeets 2015), allegedly including five different language 
families, was to be proven, the phylogenetic diversity of NEA would be lower but still 
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higher than in MSEA, not to mention that there are attempts to lump together language 
families in MSEA as well (e.g., Sagart 2016). But linguistic diversity in NEA and around 
the globe is in retreat as many speakers are shifting to larger languages. Not only the 
number of languages is fading (decrease in language diversity), but whole families such 
as Ainuic, Amuric, Tungusic, Yeniseic, Yukaghiric, and perhaps Chukotko-Kamchatkan 
as well as Samoyedic will probably not survive this or the next century (decrease in 
phylogenetic diversity). Eskaleut, which will persist in other parts of the world, could 
disappear from NEA as well. In other words, NEA could be the home of languages from 
only six families in future times, although globalization will bring many more languages 
from around the world into this area as well. 

A good overview of some areal traits found throughout Northeast Asia and adjacent 
areas has recently been given by Nichols (2010: 366): 


Interior Asia has been a center of language spread at least since the Neolithic. 
The linguistic evidence points to strong and long-term areality in the epicenter of 
spread, with innovations made in the center eventually showing up farther away. 
To judge from its distribution, the m-T pronoun type may have spread early and 
then developed its strong structural parallelism in later innovations in the cen- 
ter; case-number coexponence is found at the far peripheries of the area (besides 
Uralic and Indo-European it also occurs in Chukchi and West Greenlandic), but for 
at least the last few millennia the classic agglutinating type (with monoexponential 
and transparently segmentable suffixes) has predominated in the epicenter. Phone- 
mic front rounded vowels may have spread from the epicenter more recently. The 
consistently head-final morphosyntax of Uralic, core Altaic, Japanese, etc. is more 
generally widespread in Eurasia and not specific to this northeastern area. 


In fact, perhaps one of the strongest features of NEA are the front rounded vowels ii and 6. 
A previous study by Maddieson (2013) has shown that these are, by and large, restricted 
to Eurasia, but it seems that this is a relatively late expansion out of NEA where the 
highest concentration of languages with these vowels can be found (Table 3.3). In many 
cases, the available descriptions are not extremely specific about the exact nature of the 
vowels, i.e. whether they are exactly [y] and [e] or slightly different sounds. 


Table 3.3: Front rounded vowels in Northeast Asia in comparison with Mad- 
dieson's (2013) global sample; see $6.4 and the Appendix for the data 


Type NEA Global 
not present (or unknown) 47 525 
high and mid 24 23 
high only 8 8 
mid only 4 6 
Total 83 562 
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The comparison of the two different samples, global and Northeast Asian, is quite re- 
vealing. While altogether 36 out of 83 languages in NEA have at least one kind of front 
rounded vowel (about 43%), Maddieson (2013) found only 37 out of a sample of 562 lan- 
guages (about 7%). There are almost no languages of this type along the Pacific Rim, i.e. in 
Pacific NEA. In fact, excluding the far Northeast (Eskaleut and Chukotko-Kamchatkan) 
as well as Japan (Japonic and Ainuic) results in an even larger number of 60% (36 out of 
60 languages). In NEA all languages with front rounded vowels are from seven language 
families, namely Koreanic, Khitano-Mongolic, Trans-Himalayan (especially Sinitic), Tun- 
gusic, Turkic, Uralic, and Yukaghiric. They were historically lost in many Mongolic and 
especially Tungusic languages, in the latter case possibly because of contact with lan- 
guages along the Pacific Rim such as Amuric. Maddieson (2013) mentions only a few 
languages outside of NEA with front rounded vowels. Of these, four in the Americas, 
three in the Pacific region and one in Africa are of no concern for us here. But there are 
several languages in Eurasia, more exactly, six to the adjacent south and thirteen to the 
west of NEA that also share the phenomenon. Interestingly, the languages in the west 
include many that have an origin further to the east or within NEA (Hungarian, Finnish, 
Mari, Turkish, Azeri, Bashkir, Chuvash). Table 3.4 summarizes whether front rounded 
vowels can be reconstructed to the fourteen proto-languages of languages that are today 
located in NEA. There will be no comment on the accuracy of the reconstructions and on 
the details of later developments here, which goes beyond the possibilities of this study. 
But it may be noted that Vovin’s (1993) reconstruction of Proto-Ainuic in this case is 


highly doubtful. 


Table 3.4: Reconstructed front rounded vowels (FRV) for languages spoken in 


NEA 
Language High Mid Source Comment 
Proto-Amuric - - Fortescue 2011: 4 
Proto-CK = - Fortescue 2005: 6 
Proto-Eskaleut - - Fortescue 1998: 125 
Proto-IE - - Fortson 2010: 66 
Proto-Japonic - - Pellard 2008: 136 
Old Korean - - Whitman 2012: 28 Korean has both 
Old Chinese - - Baxter & Sagart 2014a:195 Mandarin has high FRV 
Ket - : Georg 2007: 61 
Proto-Ainuic 2+ ?+ Vovin 1993: 42ff. ?verbal stems only 
Proto-Uralic + - Sammallahti 1988: 481 
Proto-Samoyedic + + Janhunen 1998: 463 
Proto-Mongolic + + Janhunen 2003d: 4 later both mostly lost 
Khitan + - Miyake 2017: 493 
Proto-Tungusic + +  Doerfer 1978a; 1978b later both mostly lost 
Proto-Turkic + + Rona-Tas 1998: 70 
Proto-Yukaghiric + + Nikolaeva 2006: 57 high FRV later lost 
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I currently lack exact reconstructions for Koreanic, Trans-Himalayan and Yeniseic, 
which is why Old Korean, Old Chinese and Ket have been listed instead. The presence of 
front rounded vowels in Yukaghiric corroborates the hypothesis that this language fam- 
ily historically derives from a location further to the south (§2.14). Similarly, Uralic likely 
derives from a location close to or perhaps even in NEA (§2.12). Front rounded vowels in 
this part of the world seem to be a “Ural-Altaic” phenomenon (including Yukaghiric but 
excluding Japonic and Koreanic). However, the origin of the similarity does not necessar- 
ily lie in a common origin but may well be the result of prehistoric language contact in 
southern NEA. It seems that the historical center of the phenomenon clustered around 
Lake Baikal. Perhaps, its emergence is connected to the phenomenon of vowel assimi- 
lation, i.e. vowel harmony (e.g., Maddieson 2013: Chapter Text). The history of German 
shows that vowel assimilation (in this case umlaut) can most likely be responsible for the 
emergence of front rounded vowels. The list of proto-languages with front rounded vow- 
els roughly corresponds to the list of proto-languages with KIN-interrogatives. Excep- 
tions include Tungusic (without KIN-interrogative), and Eskaleut (without front rounded 
vowels). The status of both the interrogative (hunna ‘who’) as well as the vowels in Proto- 
Ainuic is questionable. 

A well-known concept of areal linguistics is that of spread versus residual zones (e.g., 
Nichols 1992: 13-24; 1997; 2015; Dahl 2001: 1460f.; Bisang 2010: 431f.). Large parts of NEA, 
especially in the steppes towards the west and along the Lena qualify as spread zones 
(Nichols 1992: 13-24). In fact, the Eurasian steppe was her prime example. Spread zones 
are areas with low phylogenetic diversity, low structural diversity, and also low language 
diversity per language family. There is also no accumulation of diversity over time. In 
spread zones there is rapid expansion of languages over vast areas that subsequently 
serve as lingua francas for and often replace languages previously spoken in that area. 


Each language or dialect group spreading westward on the steppe probably took 
the form of a classic dialect-geographical area, with a center of innovation (in its 
eastern range, at least initially) and archaisms on the periphery. Certainly there 
were centers of political, economic, and cultural influence (Nichols 1992: 16) 


One prime example of language spread is the expansion of Sinitic from around the Yel- 
low River southwards towards MSEA, an event influenced by state building, complex 
social structures, and warfare. Beginning in the 18th century, Mandarin, again starting 
from about the same area, expanded towards the regions around core China, i.e. Manchu- 
ria, Inner Mongolia, Xinjiang, Tibet, Qinghai, and the Southwest. Mandarin is not only 
used as a main language of communication in all of China and is rapidly replacing many 
minority languages, but is currently also influencing or even replacing several Sinitic va- 
rieties in the South that are the result of the earlier spread. The history of the southern 
parts of NEA over thousands of years is strongly based on the emergence and spread 
of multicultural and multilingual confederations ranging from even before the ancient 
Xiongnu (ca. 3rd century BCE to 4th century CE) to the Manchus from the 17th century 
onward. The moving factor behind the spread of languages and language families can 
often be found in cultural or technological innovations, the domestication of different 
plants and animals, etc. In the case of Indo-European (except Anatolian), for instance, 
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this possibly was the use of the wheel and wagon (Anthony & Ringe 2015). NEA has 
seen a variety of spreads of languages or language families over large distances, but to 
my knowledge, in most cases they have not been clearly linked with such innovations 
yet. We do not know, for example, which language group was connected with the origi- 
nal domestication of the reindeer in NEA several thousand years ago, which happened 
independently of the domestication in northern Europe (Reed 2008). But we know that 
the expansion of some northern Tungusic languages, some Samoyedic languages, Yuk- 
aghiric, and Chukchi were likely connected with this innovation (e.g., Janhunen 1996: 
61ff.; Helimski 1998: 480; Anderson 2006a,e). Further to the south, the domestication of 
the horse about 7000 years ago was crucial for the steppe cultures, connected with sev- 
eral language families including Indo-European, Turkic, or Mongolic (Anthony 2007). 
The yak played a comparable role for the high altitude regions in the southern periph- 
ery of NEA around the Tibetan highland, but reaching as far north as the Altai (Wiener 
2013). The domestication of the dog may have a relatively long history as compared 
to that of the other animals mentioned. A recent study found evidence "that sled dogs 
could have been used in Siberia around 15,000 years ago" (Pitulko & Kasparov 2017: 491). 
In NEA dog sleds were used, for instance, by the Nivkh and some surrounding Tungusic 
populations, but also by Samoyeds, Yukaghirs etc. However, the spread of languages is 
not necessarily based on the spread of its speech community by means of growth and 
migration. Another important mechanism of language spread is language shift, i.e. the 
shift of a given speech community from one language to another (e.g., Nichols 1997: 372; 
Janhunen 2007b: 74). Most cases are a combination of these factors. 

Spread zones are opposed to residual or accretion zones (Nichols 1997: 369f.), which 
Nichols (1992: 13-15) illustrated with the help of the Caucasus. These are areas that have 
greater phylogenetic, language, and structural diversity. Language families tend to be 
older (i.e., the age of the respective proto-language lies further in the past) and there 
are fewer movements of peoples and languages than in spread zones. “As in mountain 
areas, innovations arise in the periphery (in the lowlands) and archaisms are found in the 
interior (in the highlands)” (Nichols 1992: 14) Residual zones are areas of retreat rather 
than spread, usually do not show a single lingua franca over the entire area, and have an 
increase of diversity. There are several possible residual zones in, or rather around, NEA, 
including most of Pacific NEA (e.g., Ryükyüan Islands, Hokkaido, Sakhalin, Kuril Islands, 
Kamchatka, Aleut Islands), the lower Amur, and many mountain ranges and high altitude 
regions (e.g., Yunnan, Amdo, the Tibetan Plateau, the Himalayas, the Pamir, the Altai). It 
should be borne in mind that the features of spread and residual zones mentioned above 
do not all apply in every case but represent valid tendencies. 

Both Anderson (2006d) and Pakendorf (2010) grant Northern Tungusic (more precisely 
Ewenic) languages a special position for the Siberian area 


The features ofthe Siberian linguistic macro-area cluster around those ofthe North- 
ern Tungusic languages and this is not by accident. Indeed, the highly mobile 
Evenki (and to a lesser degree its sister language, Even) both have the local bilin- 
gualism relationships and widespread distribution necessary to make them likely 
vectors of diffusion for at least some of these features (Anderson 2006d: 294) 
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Expanding on this proposal, one might argue that it is not only the Ewenic branch, but 
all of Tungusic that used to have a rather special position for NEA. Today, Tungusic lan- 
guages are mostly endangered, moribund or already extinct (Janhunen 2005; Tsumagari 
et al. 2007), but one should not underestimate their historical influence over all of NEA. 
Most Tungusic languages are still located in Manchuria where they had a certain amount 
of impact on Mongolic, Amuric, Ainuic, and Koreanic. While Evenki and Even expanded 
into northern NEA and reached places as far apart as Kamchatka and the Taimyr Penin- 
sula, Jurchen and Manchu played an important role for the southern half. The Jurchen 
established the Jin-dynasty (1115-1234) in northern China and the Manchus had an even 
more pronounced influence during their Qing-dynasty (1636-1912) that at its height not 
only included all of modern China, but also what is now the Russian Primorye region 
as well as Mongolia. Most importantly, Manchu played the role of an ad- and substrate 
language of Mandarin in Peking, which later was the basis of Standard Mandarin. Of 
course, there are other language families such as Khitano-Mongolic or Turkic that had 
an even stronger impact in large parts of NEA. 


3.5 Subareas in Northeast Asia 


The following gives a brief overview of areas of linguistic convergence and contact found 
in NEA. The areas may overlap strongly with each other, which is not indicated in every 
case. The discussion limits itself to those areas that seem to be most important for this 
study. 

The territory of NEA as defined here is covered by six different countries: China, Rus- 
sia, Mongolia, North Korea, South Korea, and Japan. Each of these countries has a na- 
tional language that increasingly influences or even replaces all other languages and 
dialects within that country. In the case of North and South Korea these are almost 
identical. There are, therefore, five superstrate languages that may be seen as defining 
special kinds of linguistic areas. The expansion of the national languages proceeds at the 
expense of languages and dialects alike. But even if a given language is strong enough 
to resist the complete loss (e.g., Amdo Tibetan, Buryat, Tuvan, Uyghur, or Yakut), it is 
usually heavily influenced by the national language. Because the superstrate is the same 
in the entire country, there is thus a general tendency for all languages to become more 
similar to each other. 

Apart from the areas of Mandarin and Russian influence, Siberia is doubtless the 
largest subarea of NEA. There have been many studies on this northern half of NEA 
(e.g., Fortescue 1998; Anderson 2004; 2006d; Skribnik 2004; Vajda 2009b; Comrie 2013), 
but its status as a linguistic area has not been finally clarified. The best summary of areal 
features found throughout Siberia has been given by Anderson (2006d): vowel harmony, 
a high back unrounded vowel [w], four nasals (m, n, fi, n), an initial velar nasal (5-), SOV 
word order, morphologically marked reciprocal and desiderative, converbs, case-marked 
clausal subordination, many cases (especially a prolative), suffixing morphology, a dis- 
tinction of dative and allative (but see Pakendorf 2010: 715). Quite problematically, many 
ofthe features such as SOV word order or suffixing morphology are not specific to Siberia 
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(§3.2). The velar nasal indeed shows a very interesting areal pattern, but fails to define a 
Siberian area as well (Anderson 2013). The Caucasus as well as parts of Europe and South 
Asia tend towards the absence of the velar nasal altogether. An extremely large area cov- 
ering most of central Eurasia has velar nasals but not in initial position. Crucially, not 
only the northern parts of NEA, but also MSEA have the velar nasal in initial position. 
Nevertheless, NEA has a sharp boundary to the Americas where a majority of languages 
lack a velar nasal altogether. Interestingly, Mandarin and Manchu, both located in the 
southern half of NEA, historically lost the initial velar nasal, perhaps due to contact with 
Mongolic or Turkic. Despite the addition of other possible features such as the special 
use of speech act verbs (Matić & Pakendorf 2013), Siberia clearly does not qualify as a 
linguistic area comparable with Mainland Southeast Asia (Comrie 2013). Furthermore, a 
treatment of Siberia without the inclusion of at least parts of Manchuria and Mongolia 
is necessarily incomplete. 

Several scholars, notably Comrie (1981: 261-266), Anderson (2003), and Georg (2008), 
have pointed out the special position of Yeniseic languages in Siberia. Comrie (2003: 8) 
summarized the typological differences as follows. 


Ket alone has phonemic tone, and Ket alone has a consistent gender/class system, 
distinguishing masculine, feminine, and neuter nouns, with assignment to mas- 
culine and feminine genders using semantic features that go well beyond a mere 
male/female distinction. While most neighboring languages have relatively sim- 
ple, agglutinating morphological structure, Ket has a substantially different system, 
making use, for instance, of internal flexion and discontinuous roots; and while the 
neighboring languages are at least primarily suffixing, Ket makes widespread use 
of both suffixes and prefixes. 


However, some languages, notably Middle Mongol, Khitan, Manchu, and perhaps even 
the mysterious language of the Rouran (Vovin 2004), had a limited gender or sexus 
system. Prefixes can, furthermore, also be found in Ainuic, Yukaghiric, or Chukotko- 
Kamchatkan. Tones are also present in Japonic, Koreanic, and Trans-Himalayan. 

Altaic is perhaps one of the most disputed proposals for a language family worldwide. 
Still today, there is complete disagreement over the validity of the family and no conclu- 
sion is in sight. Robbeets (2015) restricts the name Altaic to Turkic, Khitano-Mongolic, 
and Tungusic and uses the term Transeurasian as a cover term for Altaic, Japonic, and 
Koreanic. However, given that Altaic or Transeurasian fails to be accepted by a ma- 
jority of scholars, it must be considered an unproven (but still interesting) hypothesis. 
This does not mean, however, that it is not possible that some of the languages, say, 
Khitano-Mongolic and Tungusic are ultimately related to each other. Janhunen (1996) 
proposed the name Khinganic for the hypothetical language family that unites Tungusic 
and Khitano-Mongolic. It should also be pointed out that there are more possibilities than 
genetic relatedness on the one hand and mutual contact on the other hand (cf. Doerfer 
1985). One of several imaginable scenarios is that at least one of the proto-languages in- 
volved really was a mixed language and thus had no clear-cut affiliation in the first place. 
Any theory would need to explain the fact that Turkic, Tungusic, and Mongolic share 
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similarities in the pronominal system that are both too similar to be due to chance and at 
the same time too similar to be of common origin, especially given the absence of a com- 
mon inherited vocabulary (Janhunen 2013: 221). An explanation that seems to be gaining 
acceptance sees the observable lexical similarities as borrowing, especially from Turkic 
to Mongolic and from Mongolic to Tungusic (e.g., Doerfer 1985; Schónig 2003). Therefore, 
one of the most important tasks still is the identification of layers of loanwords in all five 
language families (e.g., Khabtagaeva 2017; p.c. 2018). A bilateral relation of two of the 
so-called Transeurasian languages, namely Koreanic and Japonic, is being advocated in- 
dependently of their relation to Turkic, Khitano-Mongolic, or Tungusic (Whitman 2012), 
but awaits further discussion. Altaic in some parts even today is rather premature in 
the sense that the internal reconstruction of the individual language families should still 
take priority. For instance, it does not make too much sense to compare Proto-Mongolic 
with Proto-Tungusic before the evidence by Khitan (e.g., Janhunen 20122), other Para- 
Mongolic languages (e.g., Shimunek 2017), the Hüis Tolgoi inscription (e.g., Vovin 2017), 
and less well-known Tungusic languages is taken into account. The necessary first step 
for the Khitano-Mongolic side must be the continuing decipherment of Khitan, followed 
by, if possible, an improved reconstruction of Proto-Khitano-Mongolic. On the Tungusic 
side, evidence from several languages such as Alchuka, Bala, or Kyakala (e.g., Mu Yejun 
1985; 1986; 1987, Hólzl 2017b; 2018b) as well as the dialects of Oroqen (Whaley & Li 2000) 
and Manchu (e.g., Hólzl 2018a) keep being neglected in most studies. 

The term Ural-Altaic was originally a proposal for a language family which has long 
since been abandoned. Janhunen (2007b: 78) revived the term in an areal typological 
sense as 


a complex of several language families covering the entire trans-Eurasian belt from 
Finland and Lapland in northern Europe to Korea and Japan in the Far East. Other 
regions where Ural-Altaic languages are spoken, or have until recently been spo- 
ken, include Pannonia, Anatolia, Western and Eastern Turkestan, Mongolia, Man- 
churia, much of Russia, and most of Siberia. The language families conventionally 
‘classified’ as Ural-Altaic are: Uralic, Turkic, Mongolic, Tungusic, Koreanic, and 
Japonic. 


This is as problematic as the use of Altaic as a typological label (Janhunen 2007b; Jan- 
hunen 20072): "areal typology would study the geographical distribution of such features, 
rather than the characteristics of individual areas? (Dahl 2001: 1456) In fact, several of 
the few features mentioned by Janhunen are not characteristic of "Ural-Altaic" languages 
only, but can be found in the larger Eurasian area (83.2). A label such as “Altaic” or "Ural- 
Altaic” may not only lead to misinterpretations concerning genetic connections, but also 
suggests a certain typological homogeneity, which—as Janhunen is clearly aware—is not 
always the case. In many cases such as the pronominal similarities it is more plausible 
to add Yukaghiric to Ural-Altaic rather than Japonic or Koreanic. 

To my knowledge, Janhunen (1996; 1997) offers the only explicit treatment of lan- 
guages in Manchuria. But whatever the exact delimitation of Manchuria, it certainly 
does not qualify as a linguistic area in any sense because it has not been defined on 
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linguistic grounds at all. There are, however, several areas of strong convergence and 
bilingualism within Manchuria such as one around the Mongolic language Dagur that 
includes the Tungusic languages Solon, Oroqen, Manchu, and Mongolian dialects such 
as Khorchin. Gusev (2015b: 72) argues for a linguistic area “which includes the dialects 
of Negidal, Amur Tungusic, Nivkh and Ainu, and in some respects may be a part of a 
larger area, that could embrace other varieties, such as Evenki and Even, Hokkaido Ainu, 
Japanese and the languages of Kamchatka", but fails to mention any defining features for 
this larger area altogether (but see also Yamada 2010). For interference between different 
Tungusic languages in Manchuria see §5.10.1 and 86.3. 

Usually considered one of the best examples of a linguistic subarea in NEA, the Amdo 
Sprachbund can be found in northwestern China (e.g., Dwyer 1995; Slater 2003a; Zhong 
Jinwen 2007). There are many different designations for the area (Janhunen 20072), but 
the name Amdo Sprachbund has been adopted by several recent publications (e.g., Jan- 
hunen 2012c; Simon 2015; Sandman & Simon 2016). It is difficult to establish a clear 
boundary of the area, but it roughly encompasses eastern Qinghai, parts of northern 
Sichuan, and most of Gansu. The best overview of the area has been given by Janhunen 
(2007a; 2012c), according to whom the area is the result of a very unique interaction 
of Turkic, Mongolic, Tibetic, and Sinitic languages. However, historically the Tangut 
language, usually classified as Qiangic, as well as the probably Para-Mongolic language 
Tuyuhun were at some point also spoken in the area. Janhunen (2012c: 180ff.) mentions 
the following defining features of the area: SOV word order, suffixes or enclitics, case 
marking, verbal tense-aspect categories, converbs, postpositions, indefinite articles, per- 
spective marking (including loss of person marking in Turkic and Mongolic). From this 
brief list alone, however, this quite clearly does not qualify as a linguistic area at all. 
Most of these features are not only prevalent in adjacent areas, but are also extremely 
common worldwide. But it may be noted that, apart from strong interference between 
individual languages with each other (e.g., Sandman 2012; Simon 2015; Sandman & Si- 
mon 2016), there are also several creolized languages such as Gangou, Wutun, Tang- 
wang, and Hezhou Chinese, which indicates strong language contact in the area (85.9). 
Before this background it seems even possible to extend the area towards the south to 
include, for example, the language Daohua, which is a Chinese-Tibetan creole or mixed 
language spoken in western Sichuan province (e.g., Acuo Yixiweisa 2001; Chen Litong 
2017). Nevertheless, the traditional conception of the Amdo Sprachbund is adopted here 
for pragmatic reasons (see $6.3). 
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Before Chapter 5 investigates the grammar of questions in the individual language fam- 
ilies of NEA, this chapter describes the most important parameters of the typology. 


4.1 Introduction to the typology of questions 


There is a certain amount of confusion surrounding the terminology employed for what 
was called the grammar of questions in this study. Grammar books usually employ the 
terms question and interrogative (nominal or attributive), but quite inconsistently so. In 
most cases no clear-cut distinction is drawn and the terminology is simply tacitly taken 
for granted. A few examples should suffice to illustrate the extent of the problem in 
English-language publications. Schulze (2007: 250), for example, explicitly employs the 
term interrogativity for the cognitive side of the phenomenon and question for the linguis- 
tic form. A related terminology can be found in Rajasingh (2014: 103): “Interrogation is a 
semantic process of eliciting information by way of questioning” (my emphasis) Huddle- 
ston (1994: 411), on the contrary, “explores the relation between interrogative, a category 
of grammatical form, and question, a category of meaning.” (my emphasis) Dixon (2012: 
376) draws a distinction between different speech acts (e.g., questions) and grammatical 
categories (e.g., interrogative). Furthermore, for what is traditionally known as an inter- 
rogative pronoun he employs the much more fitting term interrogative word. In this study, 
the term question refers either to the formal side or is used as a cover term for both the 
formal and the semantic side taken together. The semantic side of questions will only 
be named interrogativity or interrogation if a clear distinction is called for. The so-called 
interrogative words, following Diessel (2003), will simply be called interrogatives in or- 
der to preserve a connection to the traditional term and to place at the same time an 
additional emphasis on their similarity to so-called demonstratives and on their possible 
special position in the language. 

For all we know, question-response sequences (Enfield et al. 2010) and, more generally, 
turn-taking in conversation (Stivers et al. 2009) provide a universal enchronic infrastruc- 
ture that allows a comparison of different languages with each other. Question-response 
sequences are usually accompanied by non-linguistic cues such as the gazing behavior 
of the questioner (Rossano et al. 2009) or head movements by the addressee such as a 
head shake. For practical purposes, this study concentrates on the first part of such se- 
quences exclusively, and must leave aside non-linguistic elements. While this omission 
will perhaps cause some eyebrow-raising among experts, such information can only be 
obtained through prolonged fieldwork and thus is only available in sufficient detail for 
very few languages worldwide (e.g., Levinson 2010). 


4 The typology of questions 


A full account of the historical development of the typology of questions lies beyond 
the possibilities of this study. In the following, I will only give a rough sketch with a 
focus on more recent advances. Apart from some isolated and mostly outdated studies 
(e.g., Bolinger 1957), the modern typology of questions by and large may be said to have 
started around 1970 with Ultan (1978), a cross-linguistic study based on a sample of 79 
languages (originally published in 1969), Moravcsik's (1971) investigation of polar ques- 
tions in 85 languages, and Danielsen (1972), based on a sample of about 60 languages. 
Since then, the field has made enormous advances that cannot be reviewed here in ev- 
ery detail. During the 1970s and 1980s there were relatively few important publications 
with long-lasting effects, such as a collection of papers on questions in seven languages 
in Chisholm (1984) and the study by Sadock & Zwicky (1985) (written around 1976 and 
1977) in the first edition of Language Typology and Syntactic Description. The number of 
works has been steadily increasing at an ever faster pace from the 1990s until today. By 
now there are several dozen important publications, not including studies on individual 
languages, the number of which has been growing even more rapidly. But surprisingly, 
the only investigation that may be said to represent something like a standard typology 
is Siemund (2001), which is a mere 19 pages long and by now over 15 years old. A some- 
what updated account by Kónig & Siemund (2007) can be found in the second edition of 
Language Typology and Syntactic Description. Perhaps the best general introduction to 
the typology of questions to date can be found in volume three of Dixon's (2012: 376- 
433) Basic Linguistic Theory. Table 4.1 gives a non-exhaustive overview of some impor- 
tant typological studies of questions since 1990, excluding investigations of individual 
languages and generative approaches. Few studies are based on a large sample and al- 
most all are unrepresentative of the languages of the world. Exceptions include Idiatov's 
(2007) lengthy investigation of 1850 languages and especially a series of high-quality in- 
vestigations with a sample of about 900 languages by Dryer (20131,k,j). Most studies only 
focus on specific details but do not cover the entire scope of the grammar of questions. 

There are many possible classifications of questions. For instance, Sanitt (2007: 439) 
draws a distinction between empirical ("questions whose presuppositions are undoubted 
or taken as axiomatic”) and theoretical questions (“all questions which are not empir- 
ical”). Sanitt (2011: 559) furthermore introduces a distinction between closed questions 
that “have definitive answers” (such as a riddle, see §4.4) and open-ended questions that 
“lead to other questions”. These distinctions may be useful for the philosophy of science 
(e.g., Meyer 1980), but to the best of my knowledge they are not relevant for a cross- 
linguistic investigation. 

The typology proposed in this study differs from many previous typological accounts 
that usually first drew a distinction between different question types, especially polar 
and content questions. A similar focus on polar and content questions can also be found 
in most grammar books and specialized descriptions of questions in individual languages. 
In contrast, the present study makes a primary distinction between (1) question marking, 
(2) interrogatives, and (3) other functional domains such as focus that can combine with 
question marking or interrogatives. Only a secondary distinction is made within the 
domain of question marking in different question types (Hólzl 2016b). Of course, this is 
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Table 4.1: Important typological studies of questions since 1990 


Authors Focus Languages Number 
Muysken & Smith 1990 interrogatives pidgins and creoles ca. 25 
Heine et al. 1991 interrogatives, hierarchy global, unrepresentative 14 
Lindström 1995 interrogatives global, unrepresentative 24 
Mushin 1995 interrogatives Australian languages 26 
Nau 1999 interrogative paradigms European, Australian ca. 20? 
languages 
Huang et al. 1999 general languages of Taiwan 7 
Siemund 2001 general global ca. 50? 
Bencini 2003 question marking, global, unrepresentative ca. 25? 
grammaticalization 
Diessel 2003 interrogatives global, unrepresentative 100 
Bhat 2004 interrogatives global, unrepresentative ca. 80? 
Idiatov & van der Auwera 2004 interrogatives global, representative ca. 350 
Hackstein 2004 Indoeuropean languages 2 
rhetorical question, 
grammaticalization 
Cysouw 2005 interrogatives global, unrepresentative ? 
Cysouw 2007 interrogatives, Pichis Ashéninca (Arawakan), ? 
grammaticalization global, unrepresentative 
Idiatov 2007 interrogatives (who, what) global, representative 1850 
König & Siemund 2007 general global, unrepresentative ? 
Schulze 2007 general global, unrepresentative ? 
Lichtenberk 2007 interrogatives Oceanic languages ca. 25 
Hagége 2008 interrogative verbs global, representative? 28 (217?) 
Mauri 2008 question marking, alternative global, representative ca. 60? 
questions 
Mackenzie 2009 interrogatives, hierarchy global, unrepresentative 50 
Rialland 2009 intonation African languages 145 
Stivers et al. 2009 turn-taking global, unrepresentative 10 
Journal of Pragmatics 42 general global, unrepresentative 10 
(e.g., Levinson 2010) 
Axelsson 2011 tag questions global, unrepresentative ca. 12? 
Miestamo 2011 polar question marking Uralic languages ca. 30 (200?) 
Dixon 2012 general global ca. 30? 
Hengeveld et al. 2012 interrogatives languages of Brazil 24 
Dryer 20131 [2005] polar question marking global, representative 884 
Dryer 2013k [2005] interrogatives, position global, representative 902 
Dryer 2013j [2005] polar question marking global, representative 955 
Luo Tianhua 2013 general languages of China 138 
Hackstein 2013 polar question marking, Indoeuropean languages ca. 10 
grammaticalization 
Dingemanse et al. 2013 the word huh global, unrepresentative 31 
Haspelmath & the APiCS interrogatives, position pidgins and creoles 76 
Consortium 2013b 
Haspelmath & the APiCS question marking pidgins and creoles 76 
Consortium 2013a 
Kohler 2013 question marking Ometo languages 6 
O’Connor 2014 interrogatives, position south American Indian 44 
languages 
Hólzl 2015e question marking global, unrepresentative 50 
Bowern et al. 2016 general hunter-gatherer languages 194 
Hólzl 2016b question marking global, unrepresentative ca. 20 
Kóhler 2016 general African languages 119? 
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not an altogether new endeavor. Similar ideas have already been formulated, for example, 
by Bhat (2004: 248-249) for content questions. 


The purpose of using these pronouns [interrogatives ~ indefinites] in such sen- 
tences is merely to indicate that the speaker lacks knowledge regarding a particular 
constituent. There are two other meanings that need to be expressed in constituent 
questions [CQ], namely (i) a request for information (interrogation) and (ii) restric- 
tion of that request to a particular constituent (namely the indefinite pronoun); 
these meanings are generally expressed, in these languages, with the help of addi- 
tional devices; for example, devices like the use of question particles or question 
intonation are used for denoting interrogation, whereas devices like the use of fo- 
cus particles or focus constructions are used for denoting that the interrogation is 
restricted to a particular constituent. (my square brackets) 


Bhat’s approach was the impetus for a primary separation of question marking from 
interrogatives and focus in this study. However, while Bhat concentrated exclusively on 
content questions, this study includes further question types such as polar, alternative, 
and focus questions. 

This typology excludes echo, rhetorical, and indirect questions. For the most part, re- 
search commonly known as “wh-movement” or “wh-fronting” (e.g., Cable 2007) will not 
play a dominant role within this study either. First of all, very few languages in NEA 
exhibit this phenomenon that can by no means be said to be a universal property of lan- 
guage. Second, it is, strictly speaking, neither part of the domains of question marking, 
nor of interrogatives, but belongs to the domain of focus marking. This study for the 
most part also excludes the grammatical category of indefinites that are usually derived 
from interrogatives and have been discussed in detail elsewhere (see Van Alsenoy & van 
der Auwera 2015 for a list of references). 


4.2 Question marking 


Typological variation within the domain of question marking includes (1) different for- 
mal marking strategies, (2) different semantic scopes of question markers over question 
types, (3) interaction of question marking with other functional domains, and (4) the 
overall number of question markers. But before I investigate these four aspects one after 
another, let me introduce the major question categories that can take question marking. 

The major question categories are polar questions (PQ) (also called yes-no questions), 
content questions (CQ) (also known as constituent or wh-questions), alternative ques- 
tions (AQ) (or disjunctive questions), and perhaps focus questions (FQ). The designations 
are somewhat problematic but nevertheless will be adopted here because they are the 
most common and conventional labels. Consider the following examples corresponding 
to the declarative sentence I want coffee (cf. Hélzl 2016b: 18): 
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(1) English 
a. Do you want coffee? 
b. Do you want coffee? 
c. Do you want coffee or tea? 
d. What do you want (to drink)? 


Generally, a different question category can be postulated if it has a specialized marking 
in at least one but preferably more languages. English alone, for example, fails to differ- 
entiate polar and focus questions in terms of question marking. Examples (1a) and (1b) 
exhibit the same question marking but differ with respect to the marking of focus. Focus 
is understood here in a very broad sense of a “restriction of that request to a particular 
constituent” (Bhat 2004: 246) and usually has a contrastive function. Alternative ques- 
tions, though exhibiting the same marking, in addition contain a disjunction preceding 
the second alternative, which is not the case in polar and focus questions. Now consider 
the following examples from the Na-Dene language Slavey. 


(2) Slavey (Hare, Na-Dene) 


a. sáduká ` 'aneh'j? 
Q like.this 2sc.do 


‘Do you do it this way?’ (Rice 1989: 1123) 
b. duká | nj 'aneh'j? 
like.this o 2sc.do 
‘Is this the way that you do it?’ (Rice 1989: 1133) 
c. legóhljnj gots’é  'awodee gushjikdé ‘awot’é? 
PN area.to 28G.OPT.go or home 2sc.oPr.be 
‘Are you going to Norman Wells or staying home?’ (Rice 1989: 1139) 


d. Slavey (Mountain, Na-Dene) 
‘amii ‘ayjla? 
who 3.affected.4 
‘Who did it?’ (Rice 1989: 1141) 


The examples in (2a) to (2d) are instances of polar, focus, alternative, and content ques- 
tions, respectively. Unlike English, there is a clear distinction between polar and focus 
questions. Polar, content, and alternative questions are generally accepted as separate 
categories, although alternative questions are sometimes subsumed under polar ques- 
tions (e.g., Siemund 2001). Focus questions (Kiefer 1980), on the other hand, are a category 
that is not usually recognized or included in grammatical descriptions, but nevertheless 
has some validity as shown in example (2b) above (see also Dixon 2012: 395-396). Mi- 
estamo (2011: 2) includes them in his definition of polar questions “encompassing all 
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interrogatives eliciting yes/no replies, regardless of whether they are neutral or biased 
towards a positive or negative answer, or whether they have [a] broad sentence focus 
or a more narrow focus on a particular constituent” The prime example for the cross- 
linguistic relevance of focus questions used in Hólzl (2016b) is the Japonic language 
Yuwan (see 85.6.2), which contains specialized question markers for polar (-mi), focus 
(-ui), and content questions (-u). Dixon (2012: 396) mentions yet another example with 
different polar (-ée) and focus question marking (-áa), the Cushitic (Afroasiatic) language 
Tunni. However, he also includes focus questions in the category of polar questions. Mi- 
estamo (2011) is certainly correct that in many languages focus questions exhibit affinities 
with polar questions, but this is not always the case. In the Australian language Bardi (3), 
for example, there is an affinity between focus and alternative, but not polar questions, 
which are marked with sentence-initial nganyji, which is derived from an interrogative. 


(3) a. Bardi (Nyulnyulan) 
gooyarr=arda aarli mi-n-Q-nya-gal? 
two=Q fish 2-rnR-[PsT]-catch-nEC.PsT 
"Was it two fish that you caught?’ 


b. ngay=arda nga.n.k.iid.a broome-ngan, gardi joo=warda? 
IMIN=Q go PN-ALL or 2MIN=Q 


‘Will I go to Broome today or will it be you?’ (Bowern 2012: 619) 


There do indeed seem to be relatively few languages with specialized marking strategies 
for focus questions, although this might be a distortion due to the fact that the cate- 
gory itself is not widely known in linguistic circles and therefore did not make it into 
grammatical descriptions. 

Following the methodology sketched above, there are some indications for additional 
types of questions. One such example are negative polar questions (NPQ). In most lan- 
guages, including English, these are expressed in the same way as plain polar questions 
except for the addition of a negator, e.g. Don’t you want coffee? But in Urarina, a lan- 
guage with no clear affiliations to other languages spoken in Peru, there is a specialized 
NPQ marker ta different from the plain PQ marker =na. An NPQ in addition requires a 
negative marker =ne. 


(4) a. Urarina 
hanone mésahe auna-i-pa? 
morning message hear-2=Q 
‘Did you hear the message in the morning?’ 
b. ta kure kwituku-i=pe? 
Q price know-2-NEG 
‘Don’t you know the price?’ (Olawsky 2006: 832, 834) 


Note the different syntactic behavior of the plain and the negative polar question mark- 
ers. This category seems to play no important role for NEA, however, which is why it 
has not been taken into account in this study (but see 85.6.2 on Shuri). 
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Minor subtypes of questions include negative alternative questions (NAQ, Do you want 
coffee or not?) and open alternative questions (OAQ, Do you want coffee or what?), but 
their cross-linguistic relevance remains to be investigated. NAQs are mentioned as a 
separate category because they play a very important role in the grammaticalization 
of polar question markers (§4.2.3). The category of open alternative questions has been 
proposed by Tolskaya & Tolskaya (2008). NAQs and OAQs, it seems, do not fulfill the 
requirement as an independent question type as they are generally based on the same 
construction as alternative questions. In this study the two are simply taken as useful 
labels for special subtypes of alternative questions. 

Table 4.2 lists some defining properties of the major question types described above. 
There may well be additional properties, but these are the most important ones for the 
purposes of this study. The term proposition, which is a very common and useful label, 
should not be understood in a logical sense, but in terms of embodied simulation (see 
§4.4). Polar questions and focus questions both expect an answer in the positive or in 
the negative. If they are marked with the same marker, this usually attaches to the verb 
in polar questions and to the focal element in focus questions. Very often, the same 
question marker can also be found in alternative questions, where it attaches once to 
each alternative. Focus questions and content questions share a narrow focus on one 
constituent. The difference lies in the fact that in content questions this usually is an 
interrogative, while in focus questions it is usually a fully specified nominal or verbal 
phrase or other part of the sentence. 


Table 4.2: Important properties of individual question types (cf. Dik 1997: 260; 
Dixon 2012; Hólzl 2015d; 2016b). 


Expectancy Questioned part Level of Specified 
from hearer of proposition specificity alternatives 

CQ specific part (or whole) schematic none 
information 

PO  confirmationor ` whole specific one 
negation 

FQ id. part id. id. 

AQ selection from part (or whole) specific (or two or more 
alternatives schematic) 


However, content questions do not necessarily contain an interrogative as can be seen 
in Wari', a Chapacuran language spoken in Brazil in which demonstratives fulfill the 
function of interrogatives (Everett & Kern 2007). The presence of interrogatives as a 
defining feature of content questions is also problematic otherwise. For instance, there 
are languages in which an interrogative develops into a polar question marker but is still 
identical to the interrogative. Content questions are better defined as questions that have 
a narrow sentence focus on a schematic subpart of the "proposition" and thus inquire 
about very specific information instead of a confirmation (cf. Dixon 2012). For instance, 
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Are you leaving tomorrow? (FQ) is much more specific than When are you leaving? (CQ). 
The level of specificity has been adopted from Arnheim (1969: 238) and Langacker (2008: 
19) and will be further described in §4.4. Open alternative questions are partly schematic. 

Polar questions are unique in inquiring about the whole, but there are some alter- 
native questions such as Is it raining or did someone leave the sprinkler on? (Sadock & 
Zwicky 1985: 179) and content questions of the REASON type (why?) that are similar in 
this regard. The answer to a question such as Why are you leaving? may either be a whole 
(It is going to rain.) or a partial clause (Because of the rain). In fact, in many languages 
interrogatives referring to the category of REASON are derived from interrogative verbs. 
In many instances in NEA these are converb forms of an interrogative verb meaning ‘to 
do what’, e.g. Manchu ai-na-me ‘what-v-cvB.1PFV’. Because verbs usually stand for the 
whole “proposition”, this is direct evidence that REASON interrogatives may also refer to 
the whole as well. But most alternative questions and content questions, as well as all 
focus questions, focus on a subpart of a given “proposition”. This is directly mirrored 
in overt focus markers that are often found in focus and content questions as well as 
the fact that the part of alternative questions that is identical in both alternatives (the 
unfocused part) may fall victim to ellipsis in one of the two alternatives in the majority 
of languages. 

Another difference concerns the number of alternatives that are specified (Dik 1997: 
260). Content questions imply many possible alternatives but specify none. All expected 
alternatives, however, have in common the schematic meaning just mentioned. In other 
words, an answer to When are you leaving? may be tomorrow, the day after tomorrow, 
July 4 etc. If the expectancy was wrong, the answer may also be different (e.g., No, I am 
not leaving at all.), but this is something that all question categories have in common 
(see 84.4). Polar and focus questions imply more than one alternative but have only one 
specified. Alternative questions by definition have more than one alternative and do not 
usually imply more (but see example 22 below from Mauwake). 

In some cases it is not easy to differentiate between types of question marking and 
types of questions. Two of the most difficult examples are tag questions and so-called A- 
not-A questions, both of which in previous studies have been treated as either a question 
type or marking strategy of polar questions (see Hólzl 2016b: 20). (5) and (6) are typical 
examples (based on own knowledge). 


(5 a. English 
You want coffee, right? 


b. You want coffee, don't you? 


(6) Mandarin 
WEA ES 78? 
ni yao bu yao he cha? 
2sc want NEG want drink tea 


‘Do you want to drink tea?’ 
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The solution proposed by Hoólzl (2016b) is to classify A-not-A as elliptic negative alter- 
native questions without overt marking, but juxtaposition of the two alternatives. From 
this point of view they are neither a marking strategy for polar questions, nor a ques- 
tion type on their own, but are a very special case of alternative questions (see also Clark 
1985). The category of tag questions was either not described or are simply non-existent 
in the majority of languages in NEA and therefore do not play a significant role within 
this study. But their presence in many languages from other parts of the world, such as 
Europe, means that they cannot be neglected. Problematically, many investigations of 
tag questions fail to define them properly. Furthermore, what is usually recognized as 
a tag question has a plethora of different meanings, which makes them extremely dif- 
ficult to define from a functional point of view (Mithun 2012: 2166f.). The fact that tag 
questions can be described in terms of a question tag in relation to a so-called anchor 
(Axelsson 2011) makes them unique among all of the question types mentioned above. 
In fact, the traditional category of tag questions usually consists of two sentences, which 
is why they do not, actually, qualify as question type at all. Perhaps tag questions thus 
have to be described in different terms. For reasons that will become clearer in §4.4, tag 
questions will be treated as a construction type located on a different level of analysis 
than the other question types. However, for practical purposes their formal properties 
will be briefly discussed with the other question types in §4.2.1. 

The following subsections address marking strategies (§4.2.1), the scope of question 
marking over different question types (§4.2.2), the interaction of question marking with 
other functional domains such as focus (§4.2.3), and finally, the overall number of ques- 
tion markers in individual languages (§4.2.4). 


4.2.1 Marking strategies 


Previous accounts of question marking are often restricted to the marking of polar ques- 
tions (e.g., Miestamo 2011; Dryer 20131j). This section takes a broader perspective and 
investigates question marking strategies in all question types, including, for the sake of 
completeness, the problematic category of tag questions. 

Miestamo (2011), in analogy to his earlier typology of negation, investigates the distinc- 
tion and symmetry between the marking of polar questions and declarative sentences. 
Some of the categories in Dryer (2013j) are likewise based on such a comparison. My own 
typology builds on these approaches and draws a broad distinction between marked and 
unmarked polar questions and declaratives, which defines the four different types shown 
in Table 4.3. 


Table 4.3: Marking of polar questions as opposed to declaratives (cf. Hólzl 


2016b: 21) 
unmarked PQ marked PQ 
marked declarative Type 1: Sanuma, Sheko Type 3: Crow, Sabané 
unmarked declarative Type 2: Yéli Dnye Type 4: English, Bengali 
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For Types 1 and 2 consider examples from the Ethiopian language Sheko (7) and Yéli 
Dnye (8), a language without clear affiliation spoken on Rossel Island. 


(7) Sheko (Omotic, Afroasiatic) 
a. n=māāk-ā-m. 
1sc-tell-put-IRR 
‘I will tell? 
b. n=maak-a? 
1sc-tell-put 
"Shall I tell?’ (Hellenthal 2010: 402) 


(8) Yéli Dnye 
yi kópu dé d:uu./? 
that.ANAPH thing 3.1MM.PST.PL make.PL 
‘He made it./?’ (Levinson 2010: 2743) 


Note that Yéli Dnye even lacks a distinction of intonation. Types 1 and 2 are exceed- 
ingly rare and are altogether absent from NEA, which is why they will be neglected 
in this study (see Kóhler 2013 for Type 1). Type 4 is by far the most frequent cross- 
linguistically, followed by Type 3. 

The interaction of overt question markers with intonation complicates matters, but 
this will be ignored for the moment. The South American language Sabané (9) and Ben- 
gali (10) illustrate Types 3 and 4, respectively. 


(9) Sabané (Nambikwaran) 
a. iney-i-ntal-i. 
fall-v-PRET.NEUT-DECL 
‘(S)he fell’ 
b. iney-i-ntal-a? 
fall-v-PRET.NEUT-Q 


‘Did (s)he fall?’ (Araujo 2004: 205) 


(10) Bengali (Indo-Iranian, Indo-European) 


a. tumitake ` cenó. 
2sG 3SG.OBJ know.2.PR.s 
“You know him? 


b. tumikitake ` cenó. 
2sG Q 3sG.0BJ know.2.PR.s 


‘Do you know him?’ (Thompson 2012: 200) 


The different morphosyntactic status of the marker is unimportant for this primary dis- 
tinction. 
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However, by considering Types 3 and 4 exclusively, there is a variety of different for- 
mal types of polar question marking (e.g., Siemund 2001; Miestamo 2011; Dryer 20131,j). 
Spoken language is one-dimensional. In order to signal certain information such as inter- 
rogativity, there are thus limited means available. We may simply modulate the phona- 
tion of the speech stream (intonation), change the order of elements in the speech stream 
(word order), or we may add material (morphosyntax). Among the elements that can be 
added are affixes, clitics, or free elements such as particles. These may stand either be- 
fore or after another element (prefixes vs. suffixes, proclitics vs. enclitics, preposed vs. 
postposed particles). The element with respect to which these question markers can be 
located may either be the whole sentence or a subpart such as the first constituent or the 
verb. Affixes are less free in their position than clitics and particles, and usually attach 
to the verb. 

Apart from some exceptions, intonation is not normally described in detail for lan- 
guages in NEA, if it is mentioned at all. Within this study it was impossible to remedy this 
unfortunate fact, but where possible some rough outlines are sketched (such as falling or 
rising intonation etc.). Intonation, although not universal, is certainly among the most 
important ways of marking questions cross-linguistically. However, in the majority of 
languages, intonation is combined with other markers. In Dryer's (2013j) sample of 955 
languages only 173 languages (about 18%) exclusively made use of intonation for polar 
question marking. In NEA the number is even lower (Chapter 6). Concerning the loca- 
tion and contour of question intonation there are no absolute universals (see Sicoli et al. 
2014: 4 and references therein). In fact, generalizations such as final rising intonation in 
polar questions are not true for individual languages like English (Couper-Kuhlen 2012), 
let alone from a cross-linguistic perspective. For example, Rialland (2009: 928) describes 
what she calls the lax question prosody found in a relatively large area of central and 
western Africa, which is generally characterized by “a falling pitch contour, a sentence- 
final low vowel, vowel lengthening, and a breathy utterance termination produced by 
the gradual opening of the glottis.” Because of the absence of reliable and good infor- 
mation on intonation, this study necessarily focuses on the material aspect of question 
marking. 

Question marking by word order change is almost entirely restricted to Western Europe 
and Indo-European languages (e.g., Hackstein 2013), and is extremely rare from a cross- 
linguistic perspective (Dryer 2013j). This is a feature of European languages that clearly 
differentiates them from the rest of Eurasia, including NEA. An example can be found 
in Finnish (11). 


(11) Finnish (Uralic) 
a. sä tuu-t. 
2SG come-2SG 
"You're coming: 
b. tuu-t sa? 
come-2sG 2SG 


‘Are you coming?’ (Miestamo 2011: 7, 12) 
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This seems to be a pattern that originates in Germanic languages from where it spread 
to some Uralic, Romance, and Slavic languages. Following Miestamo (2011: 12), one may 
assume an original second position enclitic marking questions as well as focus. Such 
markers normally attach to the fronted verb in polar questions and the loss of this marker 
quite naturally leaves the fronting of the verb to mark polar questions. We furthermore 
know that several Indo-European languages had a second position clitic or particle such 
as =ne in Latin (cf. §5.5.2). According to Miestamo this is also what happened in Finnish, 
which still preserves a second position clitic in other constructions. 


(12) Finnish (Uralic) 
tule-t-ko sinä? 
come-2SG=Q 2SG 


‘Are you coming?’ (Miestamo 2011: 12) 


Perhaps, Germanic had a second position clitic comparable to Gothic =u (Braune & Hei- 
dermanns 2004: 178) that was already lost in other Old Germanic languages. While the 
loss of the question marker is not actually attested for Germanic, it is for some other 
European languages such as the Uralic language Pite Saami. Wilbur (2014: 186-187, 244) 
notes that there used to be a second position question marker =gu(s) in Pite Saami that 
attached to a verb in polar questions and that almost entirely disappeared during the 
20th century. Today, polar questions are usually marked by verb-initial word order only. 
Of course, the development in languages such as Pite Saami may have been influenced 
by language contact as well. 

Many examples of different morphosyntactic markers can be found throughout this 
section as well as in Chapter 5, which is why no further examples will be given here. A 
rare strategy is the use of infixes such as in Koasati, a Muskogean language spoken in the 
US (cf. Dixon 2012: 384). In Koasati, questions may be “formed by infixing a glottal stop 
between the penultimate and ultimate syllables.” (Kimball 1991: 301) The Koasati question 
marker is a true infix -7- that can, but does not necessarily, coincide with a mopheme 
boundary (Kimball 1991: 302). Similarly rare are auxiliaries that are restricted to marking 
questions (Miestamo 2011: 4). One example stems from the Salish language Halkomelem, 
which has the auxiliary li-. This should not be confused with auxiliaries encountered 
in, but not restricted to, questions such as English to do, or with interrogative verbs 
such as to do what that are interrogatives and not question markers (e.g., Hagége 2008). 
According to Hyman & Leben (2000: 593), there are some languages in which questions 
can be marked with tones: 


In Hausa {Chadic, Afroasiatic}, a L is added after the rightmost lexical H in a yes/ 
no question, fusing with any pre-existing lexical L that may have followed the 
rightmost H (which is raised somewhat, as are any following L tones whatever 
their source). As a result, lexical tonal contrasts are neutralized. In statements, [kai] 
‘head’ is tonally distinct from [kai] ‘you [masculine]’. But at the end of a yes/no 
question, they are identical, consisting of an extra-H gliding down to a raised L. 
In Nembe {Ijoid, ?Niger-Congo}, a final lexical L becomes H in statements, and 
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a final lexical H becomes L in questions. Thus, L-L / LH contrasts such as [diri] 
‘book’ / [burt] ‘yam’ are neutralized as L-H in statements, but as L-L in questions. 
A similar case is found in Isoko {Atlantic-Congo, Niger-Congo}, where a final L 
marks positive questions, while a final H marks negative questions. This causes 
a final lexical L to remain L in a positively expressed question, while this final L 
becomes a LH rise in a negatively expressed question: [übi] ‘book’ / [übi] ‘book? 
[negative]’. (my boldface and braces) 


No example has been found in NEA for these last three types of question marking. 

Generally, it seems, the same question marking strategies as in polar questions can 
also be employed in other question types. However, this has not actually been investi- 
gated. Kónig & Siemund (2007: 292), for instance, argue that “alternative questions can 
be neglected since, at least from our current perspective, they do not seem to show any 
striking typological variation.” This general negligence of alternative questions may be 
partly due to the fact that in any given language they are known to be much less frequent 
than polar or content questions (Hoymann 2010: 2728). But Siemund (2001) and Kónig 
& Siemund (2007) are clearly wrong in their assessment that alternative questions do 
not exhibit any interesting variation to be discovered. On the contrary, they actually 
exhibit much more variation than polar questions because, in addition to the question 
marking strategies encountered above, they show interaction with coordination, have 
two or more possible loci of marking, and display interesting patterns of ellipsis that 
may affect the question marking. 

The simplest marking strategy is a mere juxtaposition of the two alternatives. How- 
ever, the two alternatives may still be marked with intonation patterns that are not al- 
ways specified. For instance, in Amis (13) each alternative is marked with “a leveling- 
rising-falling intonation pattern" (Huang et al. 1999: 650). 


(13) Amis (Nuclear Austronesian, Austronesian) 
ma-tayal kísu ma-füti? 
AG.FOC-work 2sG.NOM AG.FOC-sleep 


‘Are you going to work or sleep?’ (Huang et al. 1999: 651) 


So-called A-not-A questions, frequently encountered in MSEA, are perhaps best ana- 
lyzed as a subtype of this type of alternative question marking with an additional nega- 
tor. 


(14) Mon (Monic, Austroasiatic) 
kan hù’ kan? 


come NEG come 


‘Are (you) coming (or not)?’ (Clark 1985: 60) 


In other cases the two alternatives may be conjoined with the help of a disjunction. 
For example, Saisiyat (15) makes exclusive use of a disjunction, but lacks any further 
question marking, including intonation. 
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(15) Saisiyat (Nuclear Austronesian, Austronesian) 
nifo fam nyawa ?am fahoe?? 
2SG.GEN want cat or want dog 


‘Do you want a cat or do you want a dog?’ (Huang et al. 1999: 652) 


Some languages such as Finnish have a special interrogative disjunction (tai) that is 
not identical to the standard disjunction (vai) (e.g., Haspelmath 2007: 25). In other lan- 
guages there is no disjunction but a question marker, for example on the first alternative. 
Consider the following negative alternative question (16), which exhibits the same ques- 
tion marker found in polar questions. 


(16) Guiqiong (Qiangic, Trans-Himalayan) 
zo gutchien dzi le, me-dzi? 
3sG PN COPQ NEG-COP 
‘Ts (s)he a Guiqiong or not?’ (Jiang Li 2015: 305) 


In English (as in the translation of 16 above) the polar question marking strategy on 
the first alternative (in this case word order change) is combined with a disjunction, 
which appears to be a common European phenomenon. However, this is combined with 
a special intonation contour in English, which rises on the first and falls on the second 
alternative. In other languages, there is a question marker attached to the second alter- 
native. The following example (17) is also a negative alternative question. 


(17) Palula (Dardic, Indo-European) 
tu the phedil-u, ki na? 
2SG.NOM to  arrive.PFV-SG.M ?Q NEG 


‘Did you receive it or not?’ (Liljegren 2016: 404) 


This, again, may be combined with disjunctions. In other languages there are question 
markers on each alternative, with or without disjunction. Examples for these types can 
be found below such as in (21). Table 4.4 schematically shows some possible types of 
interaction of disjunction and question marking. Of course, it is simplified and does not 
show all possible marking strategies such as the use of intonation, particles, clitics, af- 
fixes etc. It merely schematically illustrates juxtaposition, single marking on the first or 
second alternative and double marking, all of which may combine with disjunctions. It 
becomes apparent that there are dozens of combinations of these patterns with different 
marking strategies, which makes it impossible to present them all in this section. Each 
type, furthermore, can interact with other domains such as negation. 

What is more, the plethora of different patterns in Table 4.4 above does not even 
cover all alternative question marking strategies found in the languages of the world. 
Khwarshi, for example, in addition to double marking, contains cases in which the dis- 
junction ya(gi), borrowed from Awar, is not employed once but twice (18). 
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Table 4.4: Schematic interaction of disjunction and question marking. 


1st alternative 2nd alternative 
X +/-DISJ Y 

X-Q +/-DISJ Y 

X +/-DISJ Y-Q 

X-Q +/-DISJ Y-Q 


(18) Khwarshi (Tsezic) 
me ya lo-k n-eq-se yalac’a-k ` l-i-se? 
2SG.ERG or water(IV)-o IV-bring-prs or food(IV)-o IV-do-prs 
"Will you bring water or make the meal?' (Khalilova 2009: 458) 


The language Edo (Niger-Congo) has a disjunction rà, either once between the two 
alternative, or twice following each alternative (Omoruyi 1988: 23). Additionally, the 
markers on the different alternatives are not necessarily identical as can be illustrated 
with data from Tshangla as spoken in Bhutan (19). 


(19) Tshangla (Trans-Himalayan) 
ser-ga rengan tang-pe mo, shing-ga rengan tang-pe ya? 
gold-Loc ladder bridge-ıNnF Q wood-roc ladder bridge-INF Q 
‘Should I put up a silver ladder or a wooden ladder?’ (Andvik 2010: 193) 


In Tshangla, mo is also a polar question marker and ya, which is optional in alternative 
questions, is also found in content questions. 

In some languages there is a complex expression meaning ‘(and) if not’ (20), which 
functions more or less like a disjunction but should be kept distinct as it is etymologically 
transparent. 


(20) +Akhoe Hailom (Khoe) 
uri ra ari-ba | tama-s ga i-o Jo ra ari-b.a? 
jump PROG dog-3sc.M NEG-3sG.F POT STAT-if walk PROG dog-3sG.M 


"Does the dog jump or does the dog walk?' (Hoymann 2010: 2733) 


Yet another dimension of variation concerns the number of alternatives. While it is 
true that the most typical alternative questions exhibit two alternatives, there are also 
examples with more than two, such as in (21). 


(21) Mauwake (Trans-New Guinea) 
no matukar ikiw-i-nan-i dylup-i e sarang? 
2SG PN gO-N.PST-FUT.2SG=Q PN-Q Or PN 


"Will you go to Matukar, Dylup, or Sarang?’ (Berghäll 2015: 310) 
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Mauwake usually has an enclitic =i at the first alternative and a disjunction before the 
second. When three alternatives are present, the first two take the enclitic. This example 
also illustrates that the question markers in the individual alternatives do not have to 
attach at the same place. When the set of possible answers is expected to be open, the 
construction differs slightly and the second alternative also takes the question marker 
(22). 


(22) Mauwake (Trans-New Guinea) 
matukar ikiw-i-nan-i e dylup ikiw-i-nan-i? 
PN gO-N.PST-FUT.2SG=Q Or PN ` gO-N.PST-FUT.28G-Q 
"Will you go to Matukar or Dylup (or perhaps neither)?’ (Berghäll 2015: 311) 


Some languages do not allow ellipsis of identical parts (e.g., the Austronesian language 
Rukai, Zeitoun 2007). All other languages allow some form of deletion. A very useful 
distinction that was introduced by Huang et al. (1999) for Austronesian languages on 
Taiwan is that between forward (analipsis, 23b) and backward deletion (catalipsis, 23c) 
(see also Haspelmath 2007: 39). 


(23) Mandarin (Trans-Himalayan) 

a. ni qüzhongguó (háishi) bú qu zhongguó? 
2sc go China oo NEG go China 

b. ni qüzhongguó (háishi) bú on ` ? 
2sc go China oro NEG go (China) 

c. ni qu— (háishi) bo qu zhongguó? 
2sG go (China) oro NEG go China 
‘Are you going to China or are you not going to China?’ (elicited, own 
knowledge, cf. Hólzl 2015e) 


In alternative questions the part that is not focused on may fall victim to ellipsis. In 
other words, (elliptical) alternative questions are somewhat similar to focus questions. 
This contrasts with the common assumption of alternative questions being related to 
polar questions, exclusively (e.g., Siemund 2001). 

Content question marking has not been investigated very often. Many languages have 
morphosyntactically unmarked content questions, but these may exhibit special intona- 
tion patterns that often are not clearly specified in the available descriptions. The remain- 
ing languages seem to make use of all the most common question marking strategies 
discussed above for polar questions and will thus be excluded here. Many examples can 
be found in $5. 

The marking of focus questions is difficult to investigate because most grammatical 
descriptions simply do not mention it. Most likely, they can exhibit more or less the 
same range of marking strategies as polar questions. Given their interaction with the 
domain of focus, they will be discussed further in §4.2.3 on the interaction of functional 
domains. Several examples can be found throughout Chapter 5. 
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Table 4.5: A typology of question tags according to Axelsson (2011: 803) 


invariant question tags - neutral no restrictions 
- polarity-biased 
- polarity-dependent 


variant question tags - lexically-dependent 1 
- marginal grammatically- dependent 
- grammatically- dependent most restrictions 


Tag questions have been excluded from the list of central question types in this study. 
Nevertheless, some information on their formal properties seem to be in order. Perhaps 
the best typology of tag questions has been given by Axelsson (2011: 803) (Figure 4.5). A 
main difference is drawn between invariant and variant tags. Invariant tags appear to be 
more common, both cross-linguistically and in NEA. Each is furthermore divided into 
three different subtypes. 

So-called neutral and polarity-biased question tags are neutral with respect to the 
polarity of the anchor, although the latter often prefers positive or negative anchors. 
Polarity-dependent question tags, as the name suggests, are restricted to either positive 
or negative anchors. Consider the following examples from English (24), where the first 
is a neutral (non-dependent) and the latter a grammatically-dependent question tag (own 


knowledge). 


(24) English 
a. You want coffee, right? 


b. You want coffee, don't you? 


Marginal grammatically-dependent question tags, on the other hand, “are cases where 
the use of a certain question tag is dependent on a certain grammatical feature in the 
anchor (other than polarity), but where there are no variable grammatical features in the 
tag itself” (Axelsson 2011: 805) In lexically-dependent question tags, a lexical element 
of the anchor is also found in the tag (Axelsson 2011: 805). There are relatively many 
languages in NEA for which no tag questions are attested. While at least in some cases 
this may be due to the lack of sufficient information, tag questions most likely are not a 
universal property of language. 

Another useful dimension of question tags that is somewhat less relevant for other 
question markers concerns its etymological transparency. German, for example, has a 
variety of tags, among which we find a form ge(lle) that is completely opaque from a 
synchronic perspective (25a). German richtig, on the other hand, is a common adjective 
related to English right (25b). Both are neutral question tags. 
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(25) German 
a. Du magst Kaffee, ge(Ile)? 
b. Du magst Kaffee, richtig? 


The meaning and word order are identical to the English sentence You want coffee, right? 
above (24a). In fact, most question markers are opaque from a synchronic perspective. 
Question tags, on the other hand, are frequently transparent. Question markers further- 
more tend to be extremely short (see 86.1.1). Question tags certainly can be short as well 
(e.g., English eh?), but generally tend to be longer and more complex than usual question 
markers (e.g., English isn't it?, Mandarin dui-bu-dui?). These properties underline their 
separate status. 

Mithun (2012: 2167) roughly differentiates between epistemic (e.g., informational, con- 
firmatory), and affective (e.g., facilitating, attitudinal, peremptory, aggressive) functions 
of tags. Axelsson (2011) crucially investigated only confirmation seeking (perhaps better 
called epistemic) question tags, which reduces the problem of their classification consid- 
erably. The typology correctly excludes confirmation seeking constructions that are not 
formally tag questions (Axelsson 2011: 796). Hadiyya (26), for example, has a confirma- 
tion seeking suffix Jo, which combines with the polar question marker -nni(yye). 


(26) Hadiyya (Cushitic, Afroasiatic) 
kaa ii diinate mass-i-t-aa-tto-lla-y yo-nni? 
2sG.voc 1sc.cEN money.Acc take-E-2SG-PRS.PFV-2SG-CONF-NEG-Q 


"You have taken my money, haven't you?' (Sulamo 2013: 27) 


Given the fact that the question is one single sentence, it is better classified as a spe- 
cial kind of polar question. $4.4 elaborates on the classification of tag questions. Non- 
epistemic uses are likewise excluded from this study. 


4.2.2 The scope of question marking 


While different marking strategies for questions are well-known, it is usually not recog- 
nized that these differ in their semantic scope over different question types (but among 
others see Dixon 2012: 389-390 and especially Hólz] 2015e; 2016b). Given the lack of in- 
formation for NEA, this study will make use of a limited conceptual space shown in 
Figure 4.1 that only includes the most central question types. As can be seen, polar ques- 
tions take a central position while other types—especially content questions—have a pe- 
ripheral position. Solid lines indicate the possibilities that two categories may be marked 
with the same marker. The semantic scope of a given marker may be shown as a closed 
line that encloses those categories that may be marked by it (i.e. its semantic scope). 

There is one possible implicational universal that needs further testing in other parts 
of the world but seems reasonably robust for now. 


(27) Content questions are only marked in the same way as focus or alternative 
questions if polar questions are also marked in the same way. 
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The universal is represented on the conceptual space as the lack of connecting lines 
between categories (Figure 4.1). These presumably impossible connections are given as 
dashed lines. Note that this is an example for the so-called Semantic Map Connectivity Hy- 
pothesis: “any relevant language-specific and/or construction-specific category should 
map onto a connected region in conceptual space” (Croft 2003: 134). A possible coun- 
terexample from Tshangla, which allows the use of the content question marker ya in 
alternative questions, can be found in §4.2.1. 


AQ 


Figure 4.1: Limited conceptual space of question marking 


Another possible implicational universal concerns the dashed line between focus and 
alternative questions (Hólzl 2015e). 


28) Focus and alternative questions can only be marked in the same way if polar 
q y yup 
questions are also marked in the same way. 


Only one possible exception (the Nyulnyulan language Bardi spoken in Australia, see 3 
above) was found within the global 50 language sample investigated by Hólzl (2015e). An 
obstacle for confirming or disproving the universal is severely hampered by the lack of 
adequate data for the majority of languages. The dashed lines are also meant to indicate 
that such connections might be possible after all but clearly are dispreferred. 

If the conceptual space is universally applicable, which should be the long-term goal, 
then it poses several powerful constraints on how markers can expand their scope. An 
extension of the semantic scope of a given marker, for example, is only possible if there 
is a connection in conceptual space. Every language shows a distinctive semantic map, 
but languages may have similar patterns due to universals, tendencies, chance, language 
contact or common inheritance. Given that question markers are often and freely bor- 
rowed from one language to the next, semantic maps easily change their shape. 

Content questions, which often remain unmarked morphosyntactically, are a special 
case. By comparing polar and content questions and further differentiating between mor- 
phosyntactically marked versus unmarked content questions, one gets a matrix of four 
language types (Table 4.6). 

Type 4 appears to be the most and Type 3 the least common, cross-linguistically.! In 
sum, there is a deep bifurcation between content questions on the one hand and polar 


1Given the lack of information Hólzl (2015e) omitted intonation, which should be included in future studies. 
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Table 4.6: Polar and content question marking strategies among 50 languages, 
based on Hólzl (2015e); the classification of three languages remained unclear 


CQ marked CQ unmarked 
PQ=CQ Type 1:9 Type 3: 1? 
PQ#CQ Type 2:9 Type 4: 28 


questions on the other. However, as we will see more clearly in Chapter 6, polar questions 
have closer relations to the other question types. 


4.2.3 Interaction of functional domains 


The term functional domain here covers broad universal categories such as negation, 
focus, or question marking, which themselves have many subcategories. Hólzl (2016b: 
24) distinguished between four different types of interaction between such functional 
domains shown in (29). 


(29) a. grammaticalization (1) 
b. combination (2) 
fusion (3) 


d. interaction (split types) (4) 


o 


For practical purposes, the combination of disjunction with question marking was al- 
ready covered above in 84.2.1. 

(1) Grammaticalization in this context is understood as a cover term for the shift in 
meaning of a linguistic element from one functional domain to another. Many details, of 
course, are language- and construction-specific, but here only a cursory overview similar 
to the World Lexicon of Grammaticalization (Heine & Kuteva 2002) can be given (cf. Hólzl 
2015e). Consider the following polar question from a language in Nepal (30). 


(30) Bantawa (Kiranti, Trans-Himalayan) 
am-k'e ham-si — ti-k"ar-a-?0? 
2sG.GEN-lice swap-sUP 2AS-GO-PST-Q 


‘Did you go to swap lice?’ (i.e. ‘Did you go to have sex?’) (Doornenbal 2009: 205) 


The marker -7o has been glossed as a question marker, but it is really a nominalizer, which 
is presumably the reason why the example has an additional semantic component ‘is it 
the fact that'. A similar development has also been described for Tucanoan languages in 
South America, which 


exhibit a historical and semantic relationship between nominalizations and ques- 
tions. We have also tried to demonstrate that formally the latter originate from 
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the former through a process of upgrading a nominalized predication to the status 
of an independent utterance from an inferential or mirative construction. Seman- 
tically, the interrogative meaning must have become conventionalized via stages 
expressing doubt or surprise. (van der Auwera & Idiatov 2008: 46) 


Whether exactly the same developmental path was followed in Bantawa or other lan- 
guages with this phenomenon is not known to me. 

Two other well-known examples are the development of disjunctions and negators 
to polar question markers. However, both of these developments usually start within 
the context of an elliptic alternative question. In some languages such as Edo (31), the 
second alternative is fully elliptic and the disjunction can take over the function of a 
polar question because no second alternative is specified (cf. Dixon 2012: 399-400). 


(31) Edo (Niger-Congo) 
a. Osáró bó ` ówá rà Osáró rhié okhüo? 
PN build house or PN take woman 
‘Did Osaro build a house or did he marry a woman?’ 
b. Osáró bó | ówá rà? 
PN build house o 
‘Did Osaro build a house?’ (Omoruyi 1988: 22, 23) 


Similarly, negators can develop into polar question markers in negative alternative 
questions when the second alternative only consists of the negator. Examples of this 
sort can be found in Mandarin (§5.9.2.1), for instance. A related development seems to 
start from negative alternative questions as well, but in this case the first alternative 
appears to have been deleted. In Kham (Trans-Himalayan), for example, the prefix ma- 
can express both negation and polar questions (Watters 2002: 96-101). Negators such as 
German nich(t) ‘not’ can also develop into question tags. 

Yet another frequent development is from interrogatives to polar question markers 
and question tags. This development is very rare in NEA but many examples can be 
found in Indo-European languages (85.5.2, Hackstein 2013: 100). Example (10) from Ben- 
gali above, for example, contains the polar question marker ki, which is most likely de- 
rived from the interrogative ki ^what' (Ihompson 2012: 200-203), see also (17). In the 
language Palula the interrogative ga ‘what’ developed into a question tag (32). 


(32) Palula (Dardic, Indo-European) 
so guum ga? 
3SG.NOM go.PFV.SG.M what 
‘He left, didn’t he?’ (Liljegren 2016: 404) 


This development can also be found in other languages of South Asia. For instance, the 
Dravidian language Kurux employs the interrogative ender what as a question marker 
in sentence-initial position (Kobayashi & Tirkey 2017: 241-242). Another example men- 
tioned above stems from Bardi. §6.1.3 summarizes the most important grammaticaliza- 
tion paths found during this study (see also Bencini 2003). 
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(2) Question marking is frequently combined with interrogatives in content questions 
and disjunctions in alternative questions. Interrogatives (~ indefinites) are almost univer- 
sal, but there are many languages without disjunctions, for example in northern NEA. 
Another special case concerns focus markers that are frequently present in focus and 
sometimes other question types (Figure 4.2). In English, for example, focus questions are 
expressed by usual polar question marking and additional intonational focus or a cleft 
construction (33). 


(33) English 
a. Did you go there? 
b. Is it you who went there? 


In both cases focus and question marking are merely combined with each other. For 
practical purposes disjunctions and focus marking will be treated together with question 
marking in this study, but one should keep in mind that they really belong to different 
functional domains that merely overlap with each other. 


disjunctions, focus 


AQ 


PỌ co 


focus interrogatives, focus 


Figure 4.2: Typical interaction of question marking with other functional do- 
mains 


Previous studies of question marking have presumably focused on polar questions, 
because these exhibit the least interference with other functional domains. 

(3) In instances of fusion, on the other hand, a question marker also has additional 
functions such as focus marking. When a question marker also functions as a focus 
marker, it usually attaches to the verb in polar questions and to the focal element in focus 
questions. Such an example can be found in the South American language Quechua as 
spoken in Cusco (34). 


(34) Cusco Quechua (Quechuan) 


a. wasi-y-mar hamu-nki=chu? 
house-1-DAT come-2=Q.FOC 


‘Do you come to my house?’ 
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b. wasi-y-man=chu hamu-nņki? 
house-1-DAT=Q.FOC come-2 


‘Do you come to my house?’ (Ebina 2011: 29) 


See §6.1.3 for a list of examples from NEA. 

(4) The most complex question marking systems are split systems. In such languages 
the choice between different question markers depends on other domains such as person, 
number, tense, aspect, mood, evidentiality, clause type etc. §6.1.3 lists all instances found 
in NEA. A relatively simple example can be found in the language Qiang (35), which has 
a split based on person. 


(35) Qiang (Qiangic, Trans-Himalayan) 
a. fi zme nua-n-a? 
2SG PN COP-2SG-Q 
‘Are you a Qiang?’ 
b. the: zme nua-O-nua? 
3SG PN COP-(3sG)-Q 
‘Ts (s)he a Qiang?’ (LaPolla & Huang Chenglong 2003: 180) 


Only second person singular forms take the marker -a instead of -jua. Many examples 
of split types exhibit instances of fusion, but this is not necessarily so, as this example 
illustrates. An example for a split in combination with fusion stems from the Amazonian 
language Kulina (36), which combines question marking with gender. 


(36) Kulina (Arawan) 
a. osonaa-ko? 
PN-Q.M 
‘Is he a Kashinawa?’ 
b. osonaa-ki? 
PN-Q.F 
‘Is she a Kashinawa?' (Dienst 2014: 193) 


The markers appear in both polar and focus questions. Omotic languages (Afroasiatic) 
exhibit some of the most complex split systems (see Amha 2007; 2012; Hellenthal 2010: 
401ff.; Kohler 2013; 2016; Treis 2014; Hólzl 2016b: 26 and references therein). Again, see 
§6.1.3 for those split types encountered in NEA. 


4.2.4 The number of markers 


A dimension not mentioned in Hólzl (2016b) is the sheer amount of question markers 
present in a given language. Arguably, this is yet another dimension of the complexity of 
the grammar of questions. There is a certain connection with both the scope of question 
marking and the interaction with other functional domains. A smaller scope of question 
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markers is usually, but not necessarily, correlated with a higher number of markers. The 
question marker =Ku in the Tungusic language Evenki, for example, has a broad scope 
that covers polar, focus, and alternative questions, and, indeed, Evenki has only a rather 
small amount of other question markers that also depend on the dialect, however. If ques- 
tion marking interacts with certain other domains such as person marking, there tends 
to be a higher number of markers. The average number and possible variation among 
the languages of the world is not entirely certain but presumably most languages have 
at least one or a few question markers. It is, furthermore, not self-explanatory how ques- 
tion markers should be counted at all. For instance, the Tungusic language Manchu has 
a question marker =ni that fuses with certain words such as the negative existential aki 
to yield a complex form akin. Should =ni and -n be counted as one or two markers? 
Despite such problems, it is usually unproblematic to establish whether a certain lan- 
guage exhibits a larger or smaller amount of markers relative to other languages. The 
Nicobarese (Austroasiatic) language Must, for example, according to Rajasingh (2014: 
114), only has one question marker, namely final rising intonation. The Cushitic (Afro- 
Asiatic) language Hadiyya, to give a slightly more complex example, has three main 
question markers, rising polar question intonation, the polar and alternative question 
marker -nni(yye), and the confirmation seeking suffix -lla that is usually combined with 
-nni(yye) (Sulamo 2013). The majority of languages in NEA and worldwide seem to clus- 
ter somewhere around this relatively small amount of question markers, but there are 
some languages with extremely complex question marking systems (e.g., §5.4.2 on Yupik, 
§5.9.2.1 on Sinitic, and §5.14.2 on Yukaghiric). Perhaps the upper end is formed by Omotic 
(Afroasiatic) languages, which sometimes exhibit a plethora of several dozen different 
forms organized in many different paradigms (e.g., Amha 2007; 2012; Hellenthal 2010: 
401ff.; Kóhler 2013; 2016; Treis 2014; Hólzl 2016b: 26, and references therein). 


4.3 Interrogatives 


What will simply be called interrogatives here has variously been termed wh-words, inter- 
rogative pronouns, interrogative words, question words etc. But these terms are problem- 
atic from several perspectives. First, wh refers to an English language writing convention 
exclusively (variously pronounced /h/ - /w/), even fails to capture English forms such as 
how, and has no validity whatsoever from a typological perspective. Interrogatives are, 
furthermore, not necessarily pronominal. Instead, they represent what has been called “a 
meta word-class, spanning a number of major classes" (Dixon 2002: 80) or “a pan-basic- 
word-classes word class" (Dixon 2012: 409). As we will see during this section, there 
are interrogative nouns, verbs, adjectives, adverbs etc. The terms question word or inter- 
rogative word are, therefore, more adequate than the other terms but still problematic. 
While pronoun suggests a connection with grammar, word clearly indicates a lexical cat- 
egory. It has been shown by Diessel (2003), however, that interrogatives (and perhaps 
demonstratives), do not clearly belong to either of these categories. Diessel (2003: 636) 
is certainly right in his view (also accepted by Cysouw & Hackstein 2011) that 
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while grammatical markers are commonly derived from lexical expressions, demon- 
stratives and interrogatives cannot be traced back to lexical items. While both are 
often reinforced by other lexemes, there is no evidence from any language that a 
new demonstrative or interrogative developed from a lexical source (unless the lex- 
ical source first functioned to reinforce a genuine demonstrative or interrogative). 
All this suggests that demonstratives and interrogatives have a special status in lan- 
guage and should be kept separate from genuine grammatical markers. 


Like lexical items, both demonstratives and interrogatives are often the source for several 
grammatical items. In a brief discussion on Funknet, Heine & Kuteva (p.c. 2018) made 
me aware of the fact that there are indeed several examples of demonstratives with lex- 
ical origins. Nevertheless, interrogatives and perhaps demonstratives might still form a 
class by themselves that is neither, strictly speaking, lexical, nor grammatical. “Gram- 
matical markers organize the information flow in the ongoing discourse, whereas basic 
demonstratives and interrogatives are immediately concerned with the speaker-hearer 
interaction.” (Diessel 2003: 635) Interestingly, the two often share paradigmatic similar- 
ities (see below) and the only known language without interrogatives, the Chapacuran 
language Wari’, uses demonstratives instead (Everett & Kern 2007). 

There are several imaginable typologies for interrogatives, but many of them do not 
make too much sense from a cross-linguistic perspective. For example, one might count 
the number of forms that may be encountered in one language. The number of interroga- 
tives among languages is highly variable. There are none in Wari’ (Everett & Kern 2007) 
but up to about 30 in German according to my count, including derived forms. How- 
ever, apart from the practical problem that almost no grammatical description mentions 
more than a handful of forms, it is by no means clear how such forms should actually 
be counted. Mackenzie (2009: 1133), for instance, counts “only the simple forms as true 
interrogative forms". Similarly, Hengeveld et al. (2012: 46) only include “basic question 
words". The necessary condition for these claims is a clear-cut boundary between forms 
that can be analyzed and those that cannot. However, the existence of such a boundary is 
far from clear because analyzability is clearly “a matter of degree” (Langacker 2008: 352). 
Let me illustrate this with the help of interrogatives in the Tungusic language Manchu 
(§5.10.3). There certainly are some non-analyzable “basic” interrogatives such as we ‘who’ 
that even historically are not transparent. Then there are forms such as atanggi ‘when’, 
which is not analyzable synchronically but shares a resonance (a submorpheme) a- with 
several other forms. In all likelihood it is ultimately based on the interrogative ai ‘what’, 
but the derivation remains unclear, since a word meaning perhaps ‘time’ with this form 
is not attested. Mackenzie (2009) and Hengeveld et al. (2012) would perhaps include both 
of these forms into the category of “basic question words”, but this is an arbitrary deci- 
sion. Manchu furthermore has a form aiseme ‘why’ that clearly is a combination of ai 
‘what’ and the quotative seme, which in turn may be analyzed as se-me ‘say-CVB.IPFV’. 
Despite its formal analyzability, the semantic side is not fully compositional. Further 
problems for an analysis are so-called cranberry morphs such as the second element 
in Manchu ai-bi-de ‘where’, which stands opposed to the fully-analyzable form ai-ba-de 
‘what-place-Loc’. Manchu simply has no independent form bi that would explain the sec- 
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ond element in ai-bi-de. It is not the first person singular nominative bi, nor the copula bi 
which cannot take any case markers. The most likely scenario is an idiosyncratic develop- 
ment from ba ‘place’. In any case, the point is that there are partly analyzable forms that 
constitute a scale between non-analyzable and fully-analyzable forms (see also Cysouw 
2005). This background also means that reconstructions of clear interrogative "stems" 
for any given proto-language in most instances must in principle be considered prob- 
lematic. While the analyzability of interrogatives tends to decrease over the course of 
time if no new forms are built, there may also be a development in the opposite direction, 
as witnessed in the reanalysis of German wor.um 'around where, about what' as wo.rum, 
which allows a reconnection to the word wo ‘where’ that historically lost the final -r (PIE 
*k"ór) and the creation of a new form rum. 

There are several more possible dimensions for a typology of interrogatives. Some in- 
vestigations (Heine et al. 1991; Peyraube & Wu 2005; Mackenzie 2009; Hengeveld et al. 
2012) have combined several of these dimensions (e.g., analyzability, polysemy, length) 
into one typology. However, the results that take the form of a hierarchy are simply not 
valid from a cross-linguistic perspective (Hólzl 2015c). A study mostly neglected in later 
typologies (but see Peyraube & Wu 2005) has been conducted by Heine et al. (1991), who 
investigated what they called *metaphorical relations" and how they related to interrog- 
atives in 14 different languages. Their result is a hierarchy that has the following form 


(37, slightly adapted): 
(37) PERSON « THING < ACTIVITY < PLACE < TIME < MANNER < PURPOSE/CAUSE 


According to their study, the first four categories on the hierarchy showed minimal 
phonological and morphological complexity and were often monosyllabic. TIME and 
QUALITY were slightly more complex. PURPOSE and CAUSE were found to be much more 
complex and often had the form “what-case”. Furthermore, in the languages investigated, 
THING and ACTIVITY were claimed not to be differentiated (e.g., English what, (to do) 
what). They had several interesting conclusions such as the following: 


While it remains unclear what the exact correlations between the linguistic and 
the cognitive structure of pronouns are, a few assumptions may be tentatively for- 
mulated. First, the relative degree of morphological complexity that a pronoun 
exhibits is likely to correlate to some extent with the relative degree of its cog- 
nitive complexity. [...] Second, formal similarity between different pronominal 
categories may be indicative of some kind of conceptual relation between these 
categories. (Heine et al. 1991: 59, my boldface) 


Let us now address an interesting typology by Mackenzie (2009), who, strangely, did 
not mention the study by Heine et al. (1991). He investigated interrogatives in a sam- 
ple of 50 languages. More specifically, he concentrated on so-called "cognitive complex- 
ity”, which may be accessed through an investigation of system complexity (extent of 
polysemy), item complexity (extent of analyzability), and signal complexity (number of 
phonemes, length), all of which were also included by Heine et al. (1991). The result of 
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his study also takes the form of a hierarchy which has the following form (38, put into a 
comparable format): 


(38) PERSON/THING < PLACE < TIME < MANNER < QUANTITY < CAUSE 


The major difference is that Heine et al. (1991) included activity instead of QUANTITY. In 
fact, Mackenzie (2009: 1150) himself noted that “none of the central hypotheses has been 
fully vindicated”. In my eyes, the main problem is the combination of different typolog- 
ical dimensions that are not directly connected (such as polysemy and length) and thus 
simply lead to inconclusive results. Mackenzie (2009) furthermore made some minor but 
unimportant mistakes such as counting letters instead of phonemes for Mandarin and 
including expressions about the time of day into the category of TIME. 

A follow-up study of Mackenzie (2009) was conducted by Hengeveld et al. (2012), who 
proposed a hierarchy based on “basic question words" Ge, non-analyzable interroga- 
tives). 


(39) PERSON/THING < PLACE < MANNER < QUANTITY/TIME/REASON 


Mackenzie (2009), who also investigated this problematic category, found the following 
slightly deviating hierarchy: 


(40)  PERSON/THING < PLACE < QUANTITY < MANNER < TIME < REASON 


However, the idea of a cross-linguistically valid hierarchy of “basic question words” 
has to be refuted, too (Hólzl 2015c). For example, Tungusic data result in the hierarchy 
shown in (41) (see $5.10.3): 


(41) PERSON/MANNER/QUANTITY < TIME « THING/REASON « PLACE 


As can be seen, there are severe problems such as the completely different location of 
PLACE on the hierarchy. In other words, such a hierarchy simply does not make sense 
from a cross-linguistic perspective. There is no reason to assume that a one-dimensional 
construct is capable of capturing the much more complex phenomenon of interrogatives. 
There might be some exceptions such as the frequency of certain interrogatives across 
languages that could converge to a certain degree, but this has not been investigated and 
turned out to be impossible to investigate for NEA due to lack of sufficient data for al- 
most all languages. It is also possible to investigate the mere length of interrogatives (e.g., 
German wer ‘who’ is shorter than warum why), but there does not appear to be a univer- 
sal hierarchy either (Hólzl 2015c). At least there may be a tendency for some categories 
(e.g., ‘who’, ‘what’) to have shorter forms than others (Mackenzie 2009: 1139), but this is 
not exclusively connected with the overall frequency in texts. For instance, the shortest 
interrogative in the Tungusic language Nanai is ui ‘who’, which is much less frequent 
than xooni ‘how’ (Kazama 2007: 320). Furthermore, there may be some convergence in 
the order in which interrogatives are learned by children during language acquisition. 
Previous research indicates a hierarchy of the following sort (Tomasello 2003: 159 and 
references therein, 42, slightly adapted). 
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(42) THING/PLACE < PERSON < MANNER/REASON < TIME 


However, the hierarchy is based on only a handful of languages and there is insufficient 
data for most languages in NEA. 

The following will address the (1) semantic scope (§4.3.1), (2) word class membership 
(§4.3.2), (3) diachrony (§4.3.3), (4) inflectional properties (§4.3.4), and (5) the connection 
of interrogatives to demonstratives (§4.3.5), which, for the purposes of this study, seem 
to be the most important dimensions for a typology. 


4.3.1 Semantic scope of interrogatives 


For the illustration of differences in the semantic scope of interrogatives consider example 
43 from the language Kusunda, a language without clear affiliation spoken in Nepal, and 
their English translations. 


(43) Kusunda 


a. noti na? 
INT this.AN 


‘Who is this?’ 


b. nati ta? 
INT this.INAN 


‘What is this?’ (Watters 2006: 48) 


The two categories of PERSON and THING are expressed with two different interrogatives 
(who and what) in English but with one (nati) in Kusunda. Thus, there is a difference 
in semantic scope of the interrogatives over different semantic categories. Usually, a 
narrow semantic scope goes along with a larger number of interrogatives and vice versa. 
In these examples, animateness in Kusunda is expressed by the demonstratives instead. 
As Cysouw (2005; 2007) has shown, this particular polysemy (PERSON=THING) is rare 
worldwide but relatively common in South America. In Eurasia it can also be found in 
Baltic and Tocharian B. 

The determination of the semantic scope of a given interrogative presupposes a fixed 
set of semantic categories. However, there is a certain dispute as to how many differ- 
ent categories should be postulated. The comparison in Table 4.7 is not exhaustive, but 
sufficient for our purposes (see also Mushin 1995 etc.). 

There is no agreement in terminology or number of different categories. This study 
follows Cysouw’s (2005) approach but adds additional categories. Strangely, only Heine 
et al. (1991) include the categories of ACTIVITY and PURPOSE, of which at least the first is 
rather crucial from a cross-linguistic perspective, and only Diessel (2003) mentions spa- 
tial interrogatives with an allative or ablative meaning. There are, furthermore, many 
more categories that are not included in the list, but the most prominent ones are cer- 
tainly represented. One category that should perhaps be added is kinp, which might 
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Table 4.7: A selection of different categorizations of interrogatives (Hólzl 2015c) 


Muysken & Heine etal. Diessel 2003 Cysouw Mackenzie Dixon 2012 

Smith 1990 1991 2005 2009 

who PERSON person PERSON ?individual who 

[+/-hum] 

what OBJECT thing THING what 

S ACTIVITY g - S - 

which a - SELECTION - which 

- S amount QUANTITY quantity how many/ 
how much 

why CAUSE - REASON reason why 

- PURPOSE : - - - 

how QUALITY manner MANNER manner how 

where SPACE place PLACE location where 


- - direction:to - - - 
- - direction:from - - - 
when TIME time TIME ?time when 


have been overlooked because English what kind of and similar forms in other Euro- 
pean languages is fully analyzable and thus appears non-basic. Nevertheless, this cate- 
gory has to be distinguished from the category of SELECTION, e.g. English which (one), 
which does not classify but individualizes a given referent. Thus, this study tentatively 
distinguishes the categories of PERSON, THING, SELECTION, ACTIVITY, CAUSE, MANNER, 
QUANTITY, PLACE, TIME, and KIND. Some of these have secondary subcategories such as 
COUNT (how many) or mass (how much) in QUANTITY and LOCATION (where), DIRECTION 
(whither), and source (whence) in PLACE. The category PURPOSE will not be distinguished 
from CAUSE as it does not appear to play a crucial role for languages in NEA. The same 
is true for the difference between MANNER and QUALITY. There are some additional cate- 
gories, but including them is not absolutely necessary because only a handful of forms 
is attested for most languages in NEA. There are a number of subcategories that will 
not be addressed any further. Pite Saami (Uralic), for example, apart from the selective 
interrogative mikir- ‘which’ has a special interrogative gab- ‘which one (out of two) (sc), 
which two (PLY (Wilbur 2014: 123). 

The question “What is your name?’ (see Idiatov 2007) often allows the use of two 
different interrogatives, ‘who’ and ‘what’. In some languages (e.g., 44) both interrogatives 
may be used. 


(44) Abui (Timor-Alor-Pantar) 


a. a-ne nala? 
2sG.INAL-name what 


"What is your name?' 
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b. a-ne maa? 
2sG.INAL-name who 


‘What is your name?’ (Kratochvil 2007: 129) 


Thus, there is no absolutely clear-cut or at least a language specific boundary between 
the categories of PERSON and THING. Similar problems exist for other categories such as 
MASS versus COUNT. 

Interrogatives have what can be called schematic (e.g., Langacker 2008) meaning and 
they express basic semantic categories (e.g., Schulze 2007). Direct evidence for the basic 
meaning of interrogatives can be found in many languages that have transparent inter- 
rogatives (Muysken & Smith 1990) such as English what kind of or what for. A list of 
frequent elements that are combined with interrogatives can be found in Table 4.8. For 
example, the Trans-Himalayan language Anong has a rather general interrogative k^? 
~ Hal that, if combined with a personal classifier, forms the interrogative Kail. jo? 
‘who’ (Sun Hongkai et al. 2009: 73-74). In Sheko (Omotic, Afroasiatic) the interrogative 
yira ‘what’ can take a “motive” marker; the resulting form yír-éfrità has acquired the 
meaning ‘why’ (Hellenthal 2010: 411-412). Useful but much less common alternatives for 
the designation of interrogatives are epistememes (Mushin 1995) or ignoratives (Miyaoka 
2012: 443-461), which both emphasize their relation to knowledge. 


Table 4.8: Examples for semantic connections between interrogatives and basic 
nouns etc.; see Chapter 5 for many examples 


Category English Basic Elements 

PERSON who man, person, one, DEM, CLF 
THING what thing 

SELECTION which (one) one, CLF 

KIND what kind of kind, sort, class 

ACTIVITY to do what to do, to make 

CAUSE why, what for cause, reason, DAT, CVB, PURP 
MANNER how way, fashion, manner 
QUANTITY, MASS how much much, few, ?amount 
QUANTITY, COUNT how many many, ?number 

PLACE, LOCATION where place, side, Loc 

PLACE, DIRECTION whither, where to direction, ALL 

PLACE, SOURCE whence, where from ?source, ABL 

TIME when time (+ Loc) 


Figure 4.3 is a slightly revised version of Cysouw’s (2005) illustration of major path- 
ways of the derivation of interrogatives, and may also be understood as a conceptual 
space for interrogatives (Hólzl 2015c). Similar to the conceptual space for question mark- 
ing in §4.2.2, this conceptual space of interrogatives allows a comparison of the semantic 
scope of individual interrogatives within one or across several languages. Connections 
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between categories indicate the possibility that they can be covered by the same interrog- 
ative. Arrows furthermore show common paths of developments, either merely semantic 
or by means of derivation and inflection. 


ACTIVITY REASON 
Ke p i 
THING KIND 
Em p 
PERSON MANNER ——————— — ——» TIME 
SELECTION QUANTITY 


Figure 4.3: A conceptual space of interrogatives 


Cross-linguistic data suggest that interrogatives meaning ‘what’ or ‘which’ are the 
unmarked and most basic members of the interrogative system and often serve as the 
basis for the derivation of other interrogatives. The grammaticalization of interrogatives 
to question markers and the use of interrogatives in open alternative questions offer 
additional evidence for this hypothesis; in both cases it is typically an unmarked inter- 
rogative with the meaning ‘what’ that is employed. The category PERSON occupies a 
special position as it appears to be less prone to changes and more stable diachronically. 

The conceptual space was in need of several slight revisions. For NEA the category 
of activity had to be added; it is integrated into the map with the following connec- 
tions: THING—ACTIVITY— REASON (Hólzl 2015c). An example is Manchu, which has an 
interrogative ai ‘what’. This interrogative may take a verbalizer -na- to yield ai-na- ‘to 
do what', which, in turn, may take the imperfective converb marker -me, resulting in the 
complex interrogative ai-na-me ‘why’ (literally ‘doing what’ or ‘in order to do what’). 
The category of KIND has also been tentatively added. For example, English what kind of 
suggests a connection THING—KIND (see also Idiatov 2007: 51ff.) and Mandarin zénme 
yang de MANNER—>KIND (zénme ‘how’, yang ‘kind, type’, de ‘ATTR’). It may be necessary 
to update further aspects of the conceptual space in future studies such as a possible 
connection SELECTION—>KIND, but for NEA the most important aspects are present. 

Apart from these categories, the space also lacks the categories of DIRECTION and 
SOURCE, that are clearly related to the category of PLAcE. Further categories such as 
translatives or prolatives will be ignored due to a lack of data for most languages in NEA. 
Figure 4.4 shows these three categories on a small conceptual space (Hólzl 2015c) that 
is already known from studies in case marking (e.g., Creissels 2006). Within Cysouw's 
conceptual space, the category of PLACE appears to cover not only LoCATION but the two 
categories of DIRECTION and sounck as well. For instance, Manchu absi ‘how’ derives 
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from a form meaning ‘whither’ and German woher ‘whence’ may also mean ‘how, why’ 
in certain contexts (e.g., woher denn? ‘why then?’). The conceptual space for locative 
interrogatives may be conceptualized as the result of zooming in on the category of 
PLACE. A close-up examination of QUANTITY reveals the limited conceptual space MAss— 
COUNT. 

Within the conceptual space for locative interrogatives, languages differ with respect 
to scope, markedness, whether they have case marking or special forms, and whether 
case markers are also found on nouns or not. English used to have the special forms 
whence and whither, but they have been replaced with the case marked forms where to 
and where from. Within the new system, where is unmarked for case. While in English 
to and from are usual case markers (or prepositions), German wo-hin and wo-her (de- 
rived from wo where") have special suffixes that may otherwise only be found in the 
demonstratives (and as verboids, see §5.5.3.2). English and German have three different 
forms, but Italian dove has scope over both Location (Dove sei? “Where are you?’) and 
DIRECTION (Dove vai? “Where are you going?’), while SOURCE is expressed with di/da 
dove (Di dove sei? "Where are you from?’, Da dove vieni? "Where are you coming from?"). 
A recent book on spatial interrogatives that appeared after finishing this book could 
unfortunately not be taken into account (Stolz et al. 2017). 


LOCATION 


DIRECTION SOURCE 


Figure 4.4: A simplified conceptual space for subcategories of PLACE 


4.3.2 Word class membership of interrogatives 


Typical word class membership of interrogatives is relatively straightforward (Table 4.9), 
although there is some cross-linguistic variation. As mentioned before, interrogatives be- 
long to a lot of different word classes. There are several clues for determining the word 
class of a certain interrogative such as inflectional properties or open derivations. For 
instance, interrogative verbs in many languages are either combinations of the interrog- 
ative ‘what’ with a plain verb such as ‘to do’ (English to do what) or contain a verbalizing 
element (Manchu ai-na- ‘what-v-’). To take another example, causal interrogatives are 
often verbs with a converb marker (Even ja-mi ‘why’) or nouns with a case marker such 
as the dative (Buryat yüün-de why"). Nevertheless, converb and case markers are often 
related with each other diachronically and fulfill similar adverbial functions. See Chap- 
ter 5 for many more examples. 
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Paradigms in the Australian language Djabugay (Pama-Nyungan), to give but one ad- 
ditional example, show an interesting split between a pronominal accusative pattern on 
the one hand (PERson) and a nominal ergative marking on the other (THING) (Table 4.10). 


Table 4.9: Typical word class membership of different interrogatives 


Category English Typical word class 
PERSON who pronoun, noun 
THING what noun, pronoun 
SELECTION which (one) adjective, (pro)noun 
KIND what kind of adjective 

ACTIVITY to do what verb 

CAUSE why, what for adverb 

MANNER how adverb 

QUANTITY how many/much adjective, numeral 
PLACE where, whither, whence adverb 

TIME whither, where to adverb 

DIRECTION whence, where from adverb 

SOURCE when adverb 


Table 4.10: Inflection of Djabugay (Pama-Nyungan) interrogatives (Nau 1999: 


135) 
PERSON THING 
S dju: nyirrangu 
A dju: nyi: 
O dju:ny nyi: 


4.3.3 The diachrony of interrogatives 


The diachrony of interrogatives can be described with a limited set of developmental 
paths summarized in Table 4.11. (A) Interrogatives may simply be too old to be analyz- 
able at all. To repeat the example from Chapter 1, English where or German wo(r-) go 
back directly to Proto-Indo-European "Eër. Apart from phonological changes, the form 
has been preserved over the course of several millennia. A special subtype of this is the 
loss of the resonance, i.e. the existence of the same initial sounds in several interrogatives 
(Bickel & Nichols 2007; Mackenzie 2009, Chapter 1). Such a resonance is usually the sign 
of an old etymological connection between the participating interrogatives. Given the 
predominance of suffixes over prefixes (e.g., Manchu ai-de ‘what-pat’) and the dominant 


85 


4 The typology of questions 


word order IntN (e.g., Manchu ai-ba- ‘what-place-‘) etc., this feature might be especially 
pronounced in NEA. Phonological changes, such as the bonding and fusion of such an- 
alyzable forms, lead to the emergence of resonances. In most Tungusic languages, an 
original resonance that is preserved in some languages such as Nanai x~, was lost com- 
pletely (e.g., Nanai xar vs. Manchu ai ‘what'). Such a development is unique as it affects 
all interrogatives that share the changing phonological feature. All other changes affect 
only one or two interrogatives at once. 


Table 4.11: The diachrony of interrogatives excluding developments from inter- 
rogatives to other domains (Hólzl 2015c); PT = Proto-Tungusic 


Schematic Details Example 
A INT, » INTi phonological PIE *k"ód > OE hweet > NE what 
changes 
B INT; > INT2 semantic changes Wutun age ‘which (one) > who’ 
C INTi-Xcram > INT; inflection English where to 
(> fusion) 
D INT;i-Xizx > INT? derivation, English how much 
reinforcement 
(> fusion) 
E  (INTj)-Xigx > INT; replacement Italian che > che cosa > cosa ‘what’ 
P INT, INT; > INT; convergence PT *ja, "Kai > Kh. Evenki i(i)- 
G Zug > INT ?grammaticaliza- ?Evenki anii ‘thing > INT’ 
tion 


(B) There may be semantic changes that leave the formal side more or less intact or 
are at least not directly connected with it. One such change is the development from the 
meaning ‘which one’ to ‘who’ as it can be found in several languages in NEA such as the 
Sinitic language Wutun (see also Idiatov 2007). Both demonstratives and interrogatives 
are frequently reinforced with the help of other elements, (C) grammatical (e.g., Manchu 
ai-de ‘what-Loc > where, why, how’) or (D) lexical (Manchu ai-ba-(de) ‘what-place-(Loc) 
> where’). Over the course of time these two elements normally fuse into one form. Pos- 
sible developments of these last three types can also be found on Cysouw’s (2005) con- 
ceptual space (Figure 4.4). (E) In some instances, however, the original interrogative may 
be dropped such as in Italian (che) cosa ‘thing > what’. This is somewhat reminiscent of 
one of the well-known Jespersen cycles for negation such as the gradual replacement 
of ne by pas in French. (F) Convergence is very rare and within NEA seems to be re- 
stricted to Tungusic languages. In some languages such as Khamnigan Evenki, perhaps 
due to phonological changes, two different interrogative stems merged into one form. 
This might be treated as a subtype of change (A) but has an impact on both the form an 
function of several interrogatives. (G) Whether lexical items can directly develop into 
interrogatives as argued by Schulze (2007), for instance, is highly disputed. Most schol- 
ars deny this possibility altogether (e.g., Diessel 2003; Cysouw & Hackstein 2011) and I 
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tend to agree. There may be some valid examples, such as in Evenki (see §5.10.3), but this 
certainly is much less common than developments (C), (D), and even (E). 

Most of these changes have been taken into account by Muysken & Smith (1990), who 
developed one of the best typologies of interrogative systems (Table 4.12). 


Table 4.12: The typology of interrogatives according to Muysken & Smith (1990) 


Chinese Pidgin English Sranan Jamaican Latin KiNubi 
who who(-man) (o)s(u)ma ` huu-dat quis munü 
what wat ting (o)san wa(t)/we/wara quid s(h)unú 
when wat-time o-ten wen(-taym) quando mitéén 
where wat-side (o)pe we(-paat) cur wén 
type transparent atrophied mixed transparent fused opaque 

simple > — — complex 


Muysken & Smith (1990) differentiated five different types of interrogative systems. 
Analyzable combinations of interrogatives with other elements are called transparent. 
The fusion of such analyzable forms leads to fused systems such as in Latin, which in 
most forms are still related but synchronically not analyzable. The system in KiNubi 
does not even exhibit such a relic and can be called opaque, as the interrogatives are 
synchronically non-analyzable. Jamaican Creole has both analyzable forms such as huu- 
dat (« English who-that) or we(-paat) (« English where-part), and non-analyzable forms 
such as wa(t) (« English what) and therefore can be called mixed-transparent. Quite rare 
are atrophied interrogative systems that used to be transparent but subsequently lost the 
actual interrogative marker, as in Italian (che) cosa. The analyzability of forms, of course, 
does tend to decrease over the course of time, unless new forms are built. But there may 
also be a development in the opposite direction, as witnessed in the reanalysis of wor.um 
‘around where, about what’ as wo.rum in German which allows a connection to the word 
wo ‘where’ that historically lost the final -r (PIE *k"ór). 

Under extreme contact situations an interrogative system may be disturbed or inno- 
vated. Bickerton (2016 [1981]: 65-66) and Muysken & Smith (1990) claim that creole and 
pidgin languages tend to have transparent interrogative systems. Chapter 1 has argued 
that this phenomenon might not be restricted to creoles, but could be a more general ten- 
dency of simplification due to non-native L2 acquisition of a given language (McWhorter 
2007; Trudgill 2011; Operstein 2015). Simplification in this case means the reduction in the 
number of actual interrogatives, the “regularization of irregularities”, and the “increase 
in morphological transparency” (Trudgill 2011: 62). For this reason Table 4.12 contains 
a rough scale of complexity. In most cases, innovative interrogative systems are based 
on an interrogative meaning ‘what’ or ‘which’. An exception to this rule is the language 
Pichis Ashéninca as described by Cysouw (2007), in which this function is fulfilled by 
an interrogative meaning ‘where’ (see also §5.5.3.2 on German). 
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4.3.4 Inflectional properties of interrogatives 


The inflectional properties of interrogatives are often quite complex and can only be 
briefly sketched here (see Mushin 1995; Nau 1999; Siemund 2001: 1020-1023 among oth- 
ers). Chapter 5 gives a great many examples of inflected interrogatives. 

For the inflection of interrogatives all kinds of morphological types and means are 
attested cross-linguistically. In Anong (Trans-Himalayan), for instance, the plural of 
Halte? ‘who (sch is formed by reduplication: Kail io? Hall"? ‘who (PLY (Sun Hong- 
kai et al. 2009: 74). As seen in 84.3.3, inflected interrogatives often grammaticalize into 
interrogatives with a different meaning. Locative interrogatives in Anong, for example, 
exhibit a locative marker that is analyzed as suffix here, k"a°!-a® ‘which-Loc’ (Sun Hong- 
kai et al. 2009: 73ff.). 


Table 4.13: The inflection of interrogatives in Pite Saami (Uralic; Wilbur 2014: 


120-121) 
who what 
SG PL SG PL 
NOM ge ge mij ma(-h) 
GEN ge-n ge-j ma-n me-j 
ACC ge-v ge-jd ma-v me-jd ~ ma-jd 
ILL ge-sa ge-jda ma-sa me-jda 
INESS ge-nne  ge-jdne ma-nne  ma-jdne 
ELAT ge-sste  ge-jsste ma-sste  ma-jsste 
COM ge-na ge ma-jna  me-j 


Inflection encompasses verbal (e.g., tense, aspect), nominal (e.g., person, number, gen- 
der), and other categories. The inflection of individual interrogatives usually depends on 
the word class (84.3.2) and often only a subset of the interrogatives takes inflection. In 
German, for example, wer ‘who’, but not was ‘what’, can take morphological case mark- 
ing. Only the interrogative wie viel- 'how many' can take the ordinal suffix -te that is 
specific to numerals, e.g. der wie-viel-te "the how manieth". The most important inflec- 
tional categories for NEA are perhaps number and case that are often organized into 
paradigms as in Table 4.13. 

Interrogatives may express additional nominal categories such as gender (e.g., Ice- 
landic hver ‘who.sc.M, who.sc.F’ or hvað ‘who.sc.n’, Siemund 2001: 1021), but this plays 
no important role for most of NEA. 

Inflectional properties of interrogatives can often be related to (pro)nouns or verbs, 
but not necessarily so. Often there is an overlap with the inflection of demonstratives. 
Consider the paradigms of nouns, demonstratives, and interrogatives of in Pite Saami 
(Uralic) given in Table 4.14. 

In this language there is a strong overlap of the three different paradigms, which nev- 
ertheless all have their special properties. Overall the paradigms of the demonstratives 
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Table 4.14: The inflection of nouns, demonstratives, and interrogatives (PERSON, 
THING) in Pite Saami, excluding abessive and essive markers for nouns (Wilbur 
2014: 93, 116, 120-121) 


N DEM INT 

SG PL SG PL SG PL 
NOM - (-h) -t (-h) - (mij) Ch 
GEN (-h) -j -n -j -n -j 
ACC -v -jt -v -jt -v -jd 
ILL -j -jda -sa -jda -sa -jda 
INESS -n -jn -n -jtne -nne -jdne 
ELAT -st -jst -sste -jste -sste -jsste 
COM -jn(a) -jn -jna -j -jna -j 


and interrogatives are particularly similar to each other (e.g., GEN.sc -n instead of -h). 


4.3.5 Interrogatives and demonstratives 


Of the connections to other categories, it is especially demonstratives that will play an 
important role within this study (§4.3.4, Chapter 5). In fact, many of the typological 
dimensions mentioned above, such as the diachronic developments, seem to hold for 
both categories. A connection between the two has often been noted (e.g., Dixon 2012), 
but the best analysis of this relation has been given by Diessel (2003). Consider some 
examples from the Munda (Austroasiatic) language Kharia spoken in eastern and cen- 
tral India (Peterson 2011: 178-179, 183-184). Demonstratives and interrogatives have par- 
allels both in inflection (e.g. a=te ^which-oBr,, u-te ‘this=oBL’), and derivation (e.g., a-tij 
‘which=side’, u=ti’?j ‘this side’). Languages differ from each other in how strongly devel- 
oped they are and how many interrogatives and demonstratives take part in the parallel 
development. Kharia, for example, has yet another interrogative (e.g., i=te *what-oBr) 
as well as two (and formerly three) additional demonstratives (e.g., ho=te ‘that.MED=OBL’, 
han-te ~ hin-te 'that-oBr"), not counting a loan from a neighboring language. Diessel 
(2003: 635) has shown that demonstratives, like interrogatives, “cross-cut the boundaries 
of several word classes", express basic semantic categories (e.g., Kharia tij ‘side’ etc.), 


have etymologically non-analyzable stems, are not derived from but reinforced by lex- 
ical items, and share a similar pragmatic function (Diessel 2003; 2006): “both types of 
expressions are commonly used as directives that instruct the hearer to search for a spe- 
cific piece of information outside of discourse (i.e. in the surrounding situation or in the 
hearer’s knowledge store)? (Diessel 2003: 636, my boldface) One difference between the 
two elements seems to be the fact that, while demonstratives are usually accompanied 
by a pointing gesture (Diessel 2006), this does not appear to be the case for most inter- 
rogatives. Although there are deictic interrogatives, they have a schematic meaning that 
contradicts a specific pointing gesture. In German discourse, however, in some cases a 
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selective interrogative can be accompanied with a pointing gesture, but this usually goes 
along with looking at the addressee and furling one’s eyebrows or similar indicators of 
doubt. Whether there are more specific connections between interrogatives and gestures 
remains to be investigated. 


4.4 Towards an ecological theory of questions 


One of the questions formulated in the Introduction (Chapter 1) concerned the actual 
meaning of questions (Sanitt 2011: 561). Inspired by Schulze (2007) and van der Auwera 
& Nuyts (2007), this section thus goes beyond traditional typology and explicitly tries to 
add several theoretical assumptions concerning the meaning of questions and sketches 
what might be called an ecological theory of questions. 

As noted in the Introduction, the fundamental unit for an ecological theory of lan- 
guage forms the so-called organism-environment system (OES, Jarvilehto 1998). Many 
cognitive approaches overemphasize the importance of the organism and especially the 
brain. As Ulric Neisser—the so-called father of Cognitive Psychology—said in an inter- 
view in 1997, his 1976 book “Cognition and Reality was partly an attempt to recall my 
information processing colleagues to reality, saying that there is a whole world out 
there to look at? (Szokolszky 2013: 187) However, Neisser also correctly pointed out 
that traditional Ecological Psychology (e.g., Gibson 1979) overemphasized the environ- 
mental aspect, but neglected memory and conceptualization. The theory of the organism- 
environment system, in my opinion, should aim at integrating aspects of both fields. The 
OES exists on several different time scales or causal frames (Enfield 2014) and contains 
language as an integral component (e.g., Odling-Smee & Laland 2009; Sinha 2013). How- 
ever, in the remainder of this section a focus will lie on the understudied microgenetic 
frame. Some results from the diachronic and synchronic perspectives will be taken as 
hints of the basic infrastructure of this frame. This should not lead to the misunderstand- 
ing, however, that basic elements of the human interaction engine (Levinson 2006) or 
the economics of questions (Levinson 2012a), most of which are located on the enchronic 
frame and in the sociocultural ecology, are unimportant. This section merely focuses on 
some of the less well understood aspects of questions and emphasizes the microgenetic 
frame and the cognitive ecology of language (Steffensen & Fill 2014: 7). Graesser (1985: 3) 
was probably right that “a theory of questioning is a special case of a more general the- 
ory of conversation”, which is why only some aspects can be addressed here. Given the 
brackground of this book, this section is written from a linguistic perspective, although 
insights from other disciplines are consulted whenever feasible (cf. Dillon 1982). 

Despite its ecological background, the general outline of the theory advocated here 
nevertheless is strongly based on the newly emerging simulation semantics paradigm 
that places a focus on the brain, but can easily be reconciliated with ecological ideas. The 
fundamental concept of this theory is so-called embodied simulation, which has been de- 
fined as “the re-enactment of perceptual, motor and introspective states acquired during 
experience with the world, body and mind” by Barsalou (2009: 1281) or as “the creation 
of mental experiences of perception and action in the absence of their external mani- 
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festation” by Bergen (2012: 14). These two definitions are more or less congruent and 
highlight different aspects of one and the same phenomenon. A definition offered by 
Gallese (2009: 527) in addition emphasizes the social aspect of simulations: 


By means of embodied simulation we do not just “see” an action, an emotion, or a 
sensation. Side by side with the sensory description of the observed social stimuli, 
internal representations of the body states associated with these actions, emotions, 
and sensations are evoked in the observer, “as if” he or she were doing a similar 
action or experiencing a similar emotion or sensation. That enables our social iden- 
tification with others. 


Given its neurological background, the theory may be misunderstood as focusing on 
the brain, exclusively. However, Barsalou (2009) has emphasized that simulations are 
always situated and multi-modal, which is in accordance with the theory of the OES. The 
theory is broad enough to bring together conception, perception, and action (and thus 
the organism and the environment) into one coherent theory. According to Barsalou 
(2009: 1281) 


the re-enactment process has two phases: (i) storage in long-term memory of multi- 
modal states that arise across the brain’s systems for perception, action and intro- 
spection (where ‘introspection’ refers to internal states that include affect, motiva- 
tion, intentions, metacognition, etc.), and (ii) partial re-enactment of these multi- 
modal states for later representational use, including prediction. 


Thus, simulations are never complete re-enactments but are attenuated to different de- 
grees (Langacker 2008: 536-537). 

It is especially the last aspect of a prediction or an anticipation (Jarvilehto 2009) that 
plays a crucial role for a theory of questions. Every question (rhetorical questions etc. 
aside) contain aspects that are not actually known by the speaker but merely predicted 
or anticipated to play a role within a certain context. Assuming the hearer is cooperative 
(Tomasello 2014b), the question may be answered or responded to in an expected way, 
if the anticipation turns out to be appropriate. For example, one of two specified alter- 
natives of an alternative question (45a) may be chosen as adequate and thus (partly) re- 
peated by the hearer (45a). If, however, the anticipation was inadequate, then the hearer 
will most likely point this out and give the appropriate alternative (45c) or try to find 
out what the misunderstanding is about (45d). 


(45) English 
a. When are you leaving, tomorrow or the day after tomorrow? 


b. (I’m leaving) tomorrow. 


e 


I'm not leaving, it is Bill who is leaving. 


d. I'm not leaving at all, what are you talking about? 


This is traditionally known as presupposition of a question. The background of these 
predictions has been called the pattern completion inference mechanism. 
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On encountering a familiar situation, an entrenched situated conceptualization 
for the situation becomes active. Typically, though only part of the situation is 
perceived initially. A relevant person, setting, event or introspection may be per- 
ceived, which then predicts that a particular situation—represented by a situated 
conceptualization—is about to unfold. By running the situated conceptualization as 
a simulation, the perceiver anticipates what will happen next, thereby performing 
effectively in the situation. The agent draws inferences from the simulation that go 
beyond the information given (Barsalou 2009: 1284) 


Polar, focus, and alternative questions all rely on this anticipatory mechanism. The differ- 
ence among them has to do with the fact that predictions may be more or less plausible, 
with the consequence that the information given may lead to one or more possible out- 
comes. In addition, the uncertainty may only concern a certain subpart of the entire 
simulation. This is one aspect of what is usually referred to as construal (e.g., Langacker 
2008), the ability to “construe the ‘same’ situation quite differently” (Ross 2014 [1987]: 
127). Content questions lack any specific predictions but still involve inferences in the 
sense that they rely on the activation of entrenched situated conceptualization. Consider 
the example of a broken window. We know from our previous experience that windows 
usually don’t break on their own and that somebody or something must have caused 
the glass to break. Most likely we would assume that there must be an agent responsible 
for breaking the window (e.g., one of the children usually playing soccer in front of the 
house), leading to the question Who broke the window? In case we have encountered a 
similar situation before and know the identity of a potential agent, we may also ask some- 
thing like Did Tom break the window again? Questions are an expression of the human 
imaginative capacity and thus, ironically, of knowledge, memory, and experience. 

Tomasello (2008: 84-87) differentiates between three basic communicative motives, i.e. 
requesting, informing, and sharing. Arguably, questions can be used for all three motives. 
Consider the constructed examples in (46). 


(46) English 
a. Could you open the window? 


b. Did you know Sarah is pregnant? 
c. That's beautiful, isn't it? 


Given the overall focus of this study, however, only prototypical questions can be covered 
here, i.e. actual requests for information (e.g., Levinson 20122), which is a special case of 
the first motive. However, as we have just seen, every question itself necessarily contains 
some amount of information. 


Have you ever hesitated to ask a question? Perhaps you feared it might be foolish. 
Or it might be too near the bone, too probing. Perhaps it might cause offence. Or 
it might distract us from the business at hand and lead to other things. Or it might 
open you up to the reciprocal question, which you would not want to answer. In- 
trospection suggests a plethora of reasons for suppressing questions that might 
arise in one's mind. (Levinson 2012a: 19) 
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In a certain sense, questions are an example of the perception-action cycle as postulated 
in Ecological Psychology: “animals [including humans] move so that they can perceive, 
and perceive so that they can move” (Swenson & Turvey 1991: 319, my brackets). Ques- 
tions give information in order to obtain additional information necessary for a certain 
purpose. Nevertheless, prototypical answers are a better example of the second commu- 
nicative motive. Interestingly, requesting appears to precede informing both phylogenet- 
ically and ontogenetically (Tomasello 2008: 137, 247) and thus clearly plays a fundamental 
role for human beings. The third motive is irrelevant for the purpose of this study. 

Prototypical questions may furthermore be characterized as a form of exploratory be- 
havior that results from curiosity. According to Loewenstein (1994: 87), curiosity in the 
sense of “an intrinsically motivated desire for specific information” is raised by the fo- 
cusing of a gap in our knowledge base. Such “an information gap is characterized by two 
quantities: what one knows and what one wants to know.’ All question types may be 
characterized in the same terms. In content questions the entrenched situated concep- 
tualization equips us with a schematic knowledge but inquires about a specific piece of 
information one wants to know. In the case of who, we know about an agent but want 
to know its identity. In polar and focus questions we have a specific assumption but do 
not know whether this is accurate. In alternative questions we can imagine two or more 
possibilities but do not know which one is the most accurate. The underlying pattern 
can be called a hierarchy of specificity of question types (47, cf. Levinson 2012a: 23; Hólzl 
2016a). 


(47) CQ<PQ<FQ<AQ 


The term specificity, which contrasts with schematicity, has been adopted from Langacker 
(2008: 19); see also Arnheim (1969: 238). Focus questions are more specific than polar 
questions, because the uncertainty just concerns the focused subpart which is much 
more specific than in content questions. Alternative questions appear to be the most 
specific, because they openly specify all plausible alternatives. The possible negative 
answer in polar and focus questions opens up a plentitude of alternatives. There is direct 
evidence for this hierarchy. One pattern recurring in many languages is a combination 
of a content question followed by a polar, focus, or alternative question that elaborates 
on the frame set by the content question (e.g., What do you want, coffee or tea?). Consider 
the following examples from Northeast Asia (48-50) and beyond (51-53). 


(48) Evenki (Tungusic) 
si i:-le pene-d'e-nni, [d’u-la-vi=gu, tatkit-tula=gu]? 
2sG which-ALL go-prs-2SG home-ALL-REFL.POSS=Q school-ALL=Q 


"Where are you going, [home or to school]?' (Nedjalkov 1997: 7) 


(49) Khorchin Mongolian (Mongolic) 
Ci jaa.x-sa=ji, [tolgo-cin* ubud-jaeze-n-oo]? 
2sc do.what-P.Prv-Q head-2sco.Poss hurt-PROG-PRS=Q 
"What's up, [is your head aching]?' (Yamakoshi 2015: 296) 
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(50) Udihe (Tungusic) 
i:-le una-za-i [amá:-za-la-nu zulie-ze-le-nu]? 
which-roc travel-suBJ-2sc back-N-LOC=Q front-N-Loc-Q 
"Where will you travel, [in the front or in the back]?' (Nikolaeva & Tolskaya 2001: 
812) 


(51) Teiwa (Timor-Alor-Pantar) 
amidan la qau, [tii le karian]? 
what Foc good sleep or work 


"What is better, [sleeping or working]?' (Klamer 2010: 284) 


(52) Amis (Austronesian) 
Cima Eu ta.tayra namu i taypak, [ci tunici aki]? 
who NOM go 2PL.GEN PREP PN NOM PN NOM PN 


‘Which one of you is going to Taipei, [Lungi or Aki]?’ (Huang et al. 1999: 650) 


(53) Central Alaskan Yupik (Eskaleut) 
nali-ak assik-siu, [kuuvviaq  wall’u saayuq]? 
which-Ass.3Du.sc like-2sG.38G.Q coffee.ABS.SG or tea.ABS.SG 


‘Which do you want, [coffee or tea]?’ (Miyaoka 2012: 170) 


It is difficult to determine whether this is a universal pattern, because grammar books 
never explicitly address it as a phenomenon on its own right. Nevertheless, the fact that it 
can be found in languages from around the world indicates that it is a strong tendency at 
the very least. Future studies have to determine the exact meaning of this pattern, which 
may differ from instance to instance and from language to language. Additional examples 
from Chalkan, Chuvash, Udihe, Uilta, Uzbek, Kalmyk, and Ket can be found throughout 
Chapter 5. See also §6.3 for examples from the Timor-Alor-Pantar language Abui and 
the Austronesian language Balantak. In general terms the pattern can be described as the 
iconic linguistic expression of a possible universal that starts with the schematic and, by 
means of exploration and anticipation, gradually arrives at the more specific (e.g., Bar 
2009; Barsalou 2009). The same phenomenon can be observed in focus questions with a 
focus on generic nouns that are more specific than interrogatives, but are followed by 
a question with an even more specific or proper noun (e.g., Do you want tea, Earl Grey 
or Pu-Erh perhaps?). In both cases the crucial point is that the first question is located 
lower on the scale of specificity in (47) than the second. In a way, epistemic tag questions 
mirror this structure because they start from a rather general statement and arrive at 
the specific question of whether this statement is appropriate (e.g., You want tea, right?). 
A major difference, however, is the fact that the first element in a tag question is not a 
question itself or at least is not overtly marked as such. Another difference is the scope 
of the second question over the whole proposition in the case of many tag questions. 
Alternative questions exhibit some affinity to this pattern as well, but there are major 
differences. While in all examples above the second sentences elaborate on, or are based 
on, the first one, alternative questions have mutually exclusive alternatives. A similarity 
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with alternative questions is, however, the fact that in both cases there is the possibility 
of ellipsis (cf. Do you want tea or (do you want) coffee? and What do you want, (do you 
want) coffee or tea?). Nevertheless, alternative questions are better treated as a question 
type comparable to polar and content questions as there are further differences, such as 
the connection of alternative questions with the domain of coordination. Tag questions, 
which are much more similar to this pattern than alternative questions, of course, do 
not repeat the same statement. Instead, question tags may anaphorically refer to the 
statement (e.g., isn’t it?). 

To borrow a term from Langacker (2008) again, it may be claimed that these combi- 
nations of questions follow a natural and dynamic path of mental access that unfolds 
through time, from the schematic to the specific: 


Between the moment the organism is confronted with the problem and the moment 
the final solution is achieved there occur, as a rule, a number of intermediate steps 
leading, in an hierarchical fashion, from general to more specific features of the 
sought-after solution. (Duncker & Krechevsky 1939: 178, emphasis modified) 


In Langacker’s (2008: 83) terminology, this can also be called a reference point rela- 
tionship, in which the second part (the target) is mentally located with respect to the 
first (the reference point). Dewey’s (1910: 102) description of the phenomenon is still sur- 
prisingly accurate. He differentiates between three different situations, the first two of 
which define the extremes, i.e. absolute certainty and uncertainty: 


Unless there is something doubtful, the situation is read off at a glance; it is taken in 
on sight, ie. there is merely apprehension, perception, recognition, not judgment. 
If the matter is wholly doubtful, if it is dark and obscure throughout, there is a 
blind mystery and again no judgment occurs. 


The third situation exactly corresponds to the scale of uncertainty in between these ex- 
tremes: 


But if it suggests, however vaguely, different meanings, rival possible interpreta- 
tions, there is some point at issue, some matter at stake. Doubt takes the form of 
dispute, controversy; different sides compete for a conclusion in their favor. Cases 
brought to trial before a judge illustrate neatly and unambiguously this strife of 
alternative interpretations; but any case of trying to clear up intellectually a doubt- 
ful situation exemplifies the same traits. A moving blur catches our eye in the dis- 
tance; we ask ourselves: “What is it? Is it a cloud of whirling dust? a tree waving its 
branches? a man signaling to us?” Something in the total situation suggests each 
of these possible meanings. Only one of them can possibly be sound; perhaps none 
of them is appropriate; yet some meaning the thing in question surely has. 


Not only this combination of questions, but questions in general can be characterized as 
an expression of uncertainty (e.g., Schulze 2007). However, uncertainty is merely one of 
several collative variables, a term coined by Berlyne (1960: 44). 
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For want of a more satisfactory term, we shall call them collative variables since, in 
order to evaluate them, it is necessary to examine the similarities and differences, 
compatibilities and incompatibilities between elements—between a present stim- 
ulus and stimuli that have been experienced previously (novelty and change), be- 
tween one element of a pattern and other elements that accompany it (complexity), 
between simultaneously aroused responses (conflict), between stimuli and expec- 
tations (surprisingness), or between simultaneously aroused expectations (uncer- 
tainty). 


Given the ecological background of this study, the terms stimulus and response have to 
be treated with caution. Instead of passively reacting to the environment, the organism 
itself may engage in active exploratory behavior (e.g., Dewey 1896; 1910: 193; Gibson 
1960; 1979: 55ff.; Gibson 1988: 5-6). Conceptually, this is a similar distinction as that be- 
tween natural selection by the environment and niche construction by the organism that 
we have seen in the Introduction (Odling-Smee & Laland 2009). In many cases, it is the 
actions and the movements of the organism itself that lead to the pick-up of novel, chang- 
ing, complex, conflicting, surprising, or uncertain information. Baranesa et al. (2015: 89) 
argue “that curiosity can be viewed as a pro-active process that anticipates, or moti- 
vates agents to obtain new information, whereas surprise indicates a reactive process 
after having processed the information? (my boldface) This in turn results in further 
exploratory behavior. 

Of course, there is also the artificial arousal of curiosity such as, for instance, in a 
riddle, which “compares an object to another entirely different object. Its essence consists 
in the surprise that the solution occasions". Eventually, "the hearer perceives that he has 
entirely misunderstood what has been said to him.” (Taylor 1943: 129) The riddle arouses 
curiosity in the addressee by means of collative information and initiates the search for 
the solution. Take an example from the Tungusic language Uilta, which starts with the 
introduction gar gar) gajagoo! and goes on as follows: boo toptongoor, naa toptongoor, xai- 
gaak? toksiik unuu! ‘In heaven there are spots, on earth there are spots, what are they? 
Riddle me!’ The riddle has several possible answers such as boo unigarinnii sundatta 
xasiktannii "Ihe stars in heaven and the scales of fish’ The answer is followed by the 
reply toksiik ‘Correct. (Ikegami 1958: 93), which puts an end to curiosity. 

A basic typology of different kinds of curiosity was also sketched by Berlyne (1954) 
who differentiates between two dimensions that define four types of curiosity (see also 
Dewey 1910: 30ff.). These have been concisely summarized by Loewenstein (1994: 77) as 
follows. 


Perceptual curiosity referred to “a drive which is aroused by novel stimuli and 
reduced by continued exposure to these stimuli" [Berlyne 1954: 180]. Epistemic 
curiosity referred to a desire for knowledge and applied mainly to humans. Specific 
curiosity referred to the desire for a particular piece of information, as epitomized 
by the attempt to solve a puzzle. Finally, diversive curiosity referred to a more 
general seeking of stimulation that is closely related to boredom. In the four-way 
categorization produced by these two dimensions, specific perceptual curiosity is 
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exemplified by a monkey’s effort to solve a puzzle, diversive perceptual curiosity 
is exemplified by a rat’s exploration of a maze [...], specific epistemic curiosity 
is exemplified by a scientist’s search for the solution to a problem, and diversive 
epistemic curiosity is exemplified by a bored teenager’s flipping among television 
channels. (my boldface and square brackets) 


It is especially specific epistemic curiosity that plays a crucial role for the characterization 
of questions. Above, we have already encountered the knowledge gap theory of curios- 
ity by Loewenstein (1994), which is strongly based on Gestalt Psychology: “If curiosity 
is like a hunger for knowledge, then a small ‘priming dose’ of information increases 
the hunger, and the decrease in curiosity from knowing a lot is like being satiated by 
information.’ (Kang et al. 2009: 963) The first to sketch a gestalt approach to curiosity 
was also Berlyne (1954: 181), proposing “a drive to fill in such gaps in the subject’s ex- 
perienced representations”. This is based on the well-known gestalt principle of closure. 
Fritz Perls (1973: 119)—the father of Gestalt Therapy—put it this way: “The gestalt wants 
to be completed. If the gestalt is not completed, we are left with unfinished situations, 
and these unfinished situations press and press and press and want to be completed.” Ina 
different terminology one could say that an embodied simulation wants to be completed. 
For instance, unanswered questions usually lead to an “increased effort in constructing 
a coherent representation” (Hoeks et al. 2013: 8). If there is insufficient information to 
complete a simulation, curiosity and exploration set in. What exactly the evolutionary 
origins of curiosity are is another matter that cannot be addressed here. The point is that 
curiosity is a psychologically real phenomenon and has to be taken into account for a 
characterization of questions. Gibson’s (1979: 219) statement that “[t]he visual system 
hunts for comprehension and clarity” can perhaps be generalized to the entire organism- 
environment system. Humans seek comprehension and clarity, and questions are one 
way of achieving this. Berlyne’s (1954: 182) description is based on a somewhat outdated 
terminology but nevertheless remains basically valid: 


When a question is put, whether by the subject himself or by somebody else, and 
the answer is already known, the appropriate response is made as a reaction condi- 
tioned by previous learning to the stimulus-pattern, and this relieves the drive im- 
mediately, so that the subject can proceed to some other activity. However, when 
the answer is not known, the drive will persist, and some sort of trial-and-error 
process can be expected to follow as with any other drive-state. 


He mentions three different possibilities for this “trial-and-error process”, thinking, ob- 
servation, and recourse to authority. The first refers to processes mostly restricted to the 
organism such as problem solving or memory, but the latter two roughly correspond to 
the physical and social environment, respectively (see also Lewin 1936: 24ff.; Steffensen 
& Fill 2014: 7). 

Put differently, one may resolve curiosity in three different but interrelated ways. First, 
in most cases one's own experience and memory are sufficient, although in some cases 
additional thought processes such as problem solving may be necessary. This is the tra- 
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ditional realm of Cognitive Science. In his Natural history of human thinking, Tomasello 
(2014a) defines thinking as 


a single cognitive process, but one that involves several key components, especially 
(1) the ability to cognitively represent experiences to oneself “offline”; (2) the ability 
to simulate or make inferences transforming these representations causally, inten- 
tionally, and/or logically; and (3) the ability to self-monitor and evaluate how these 
simulated experiences might lead to specific behavioral outcomes—and so to make 
a thoughtful behavioral decision. 


The fundamental mechanism of representation assumed in this study is embodied sim- 
ulation as defined above. Perhaps most instances of curiosity are simply resolved by 
inferences and predictions and their subsequent evaluation whether they are plausible 
or not. But it is wrong to assume, as Tomasello seems to be well aware, that simulations 
may be completely “offline” or detached from the environment. In fact, as Glenberg (1997: 
1) observed, simulations may be said to be basically “driven by the environment”: 


A significant human skill is learning how to suppress the overriding contribution 
of the environment to conceptualization, thereby allowing memory to guide con- 
ceptualization. The effort used in suppressing input from the environment pays off 
by allowing prediction, recollective memory, and language comprehension. 


The pay-off is a plausible evolutionary explanation to pay less attention to a potentially 
dangerous environment. But Glenberg’s (1997) inclusion of language comprehension is 
problematic, as language usually is an aspect of our social environment. Simulations 
based on language, for example when we listen to somebody asking us a question, are 
certainly driven by the (social) environment. 

Second, in some cases we may encounter situations or objects that we have not en- 
countered before or are otherwise unfamiliar with. In this case we may simply move 
around, explore, and change our relative perspective and distance in order to perceive 
previously inaccessible aspects. This is something Ecological Psychology has focused on 
from its very beginnings (see Gibson 1979). In this sense, curiosity is simply resolved by 
physically exploring and changing our position relative to the problematic object. Small 
objects, of course, may be grasped and turned in order to be investigated in a more thor- 
ough fashion. A different example of physical exploration based on diversive instead of 
specific curiosity can be illustrated by the wanderlust of the Tungusic speaking Evenki 
in Siberia. 


The Evenki learn from an early age to be interested in, rather than frightened by, 
risky situations and the possibility of exploring new territories. For them, seek- 
ing out new places offers a wonderful opportunity to experience companionship 
and, as a result, it is common to go somewhere just for the sake of exploration. 
(Safonova & Santha 2013: 142, my boldface) 


This is one of several reasons for the extraordinary wide distribution of Evenki and their 
close linguistic relatives such as the Even over all of the northern half of NEA (§2.10). 
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Third, instead of observing or thinking on our own, we may also see whether other 
people can help us clarify certain aspects of a problematic object or situation. This is 
mostly accomplished by means of language, of course, and the most prototypical tool 
for this are questions. Hodges (2009: 636), in analogy to Gibson’s (1979) ambient optic 
array, proposed the name dialogical array, 


a group of hearer-speakers surrounding a given speaker-hearer, listening and talk- 
ing in ways that reveal, inevitably, something of their perspectives, their intentions, 
and their histories relative to the present place and time. Like light, the ordered ges- 
tures of the array, as well as their disordering and reordering over time, allow a 
participant in the array to have their own orderings restructured on various scales. 
It is an array of partners, actual and potential, who provide information, not just 
about themselves as intentional agents and as objects, but about objects, events, 
and agencies beyond the physical and temporal horizons of the immediate physi- 
cal surround. (Hodges 2009: 636) 


The dialogical array affords (Lewin 1936; Gibson 1979) linguistic interaction such as ask- 
ing questions. From one point of view questions are a form of bodily action that bring 
about changes in the dialogical array, which in turn allows the pick-up of new informa- 
tion (cf. Swenson & Turvey 1991). This reliance on other people potentially brings with 
it the danger of deception and misinformation as well as of social costs (e.g., Levinson 
2012a: 20), but pays off by being faster and requiring less effort, especially if we are deal- 
ing with complex problems. This might be the reason why questions apparently are a 
universal property of language. While exploratory behavior can also be found in other 
animals, language in general and questions in particular crucially depend on the ultra- 
social nature of human beings who usually tend to cooperate with each other in ways 
that are unique (Tomasello 2014b). 

Of course, the above distinction is only a heuristic one. In principle, the three means 
of resolving curiosity are interrelated and often combined. They merely highlight dif- 
ferent aspects of the organism-environment system (Lewin 1936: 27; Steffensen & Fill 
2014: 7). Questions, for instance, necessarily contain aspects of all three types of explo- 
ration mentioned above. While the social dimension is the most important, both physical 
movements (e.g., eye contact) as well as thinking (e.g., predictions) are crucial elements 
as well. Questions trigger incomplete simulations in the hearer, based on her experience 
and memory, who then engages in exploratory behavior herself. Here basically the same 
three mechanisms come into play. Either the hearer has sufficient information to fill in 
the gaps herself, or she engages in other exploratory behavior (e.g., looking something 
up), or seeks additional help and asks the same question of somebody else who is likely 
to know the answer. 

The discussion thus far has overemphasized the microgenetic aspect of questions, but 
this last point has mentioned some enchronic properties as well. The social aspect of 
questions can be observed, for example, through a relatively strong obligation on the 
part of the addressee to respond (e.g., Levinson 2012a: 16). The interaction of questions 
with evidentiality offers additional insights into the social nature of interrogativity. In 
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many languages that have perspective marking, there is a shift to the perspective of the 
addressee in questions. Consider some examples from the Cha’palaa language spoken in 
Ecuador that has an egophoric system. 


(54) Cha’palaa (Barbacoan) 


a. (i-ya) pipe-yu. 
1-roP bathe-EGo 


‘I bathed? 

b. (nu/ya) pipe-we. 
2/3 bathe-N.EGO 
"You/(s)he bathed? 


c. (fiu-ya) pipe-yu? 
2-roc bathe-Eco 


‘Did you bathe?’ 


d. (ya-a) pipe-n? 
3-Foc bathe-N.EGO.Q 


‘Did (s)he bathe?’ (San Roque et al. 2017: 136-137) 


The egophoric marker -yu appears in both statements that refer to a first person (54a) and 
in questions that refer to a second person (54c). Tournadre & LaPolla (2014: 245) capture 
this phenomenon with the anticipation rule, which they illustrate with Tibetan: “when- 
ever the speaker asks a direct question of the hearer, she should anticipate the access/ 
source available to the hearer and select the evidential auxiliary/copula accordingly.” The 
underlying mechanism can be explained with the help of embodied simulation: The ques- 
tioner asks the question as if she was the addressee herself (Gallese 2009: 527), using 
predictions obtained through mentally simulating the situation. See §5.9.2.1 on Wutun 
and §5.9.2.2 on Amdo Tibetan for additional examples. 

According to Schulze (2007: 248), furthermore, there is a “strong coupling of the first 
person with assertions and of the second person with modal features, among them in- 
terrogativity" This important observation, it seems, can be directly observed in a num- 
ber of languages that exhibit a split type based on person. Qiang, for example, which 
we have encountered above, has a special question marker for second person singular. 
Some Turkic languages have a split system that is sensitive to second person, too (§5.11.2). 
Regarding West Greenlandic (Eskaleut), Sadock (1984: 199) observed that 


in all cases where the subject is second person, there is an interrogative form that 
is distinct from the indicative; in some cases where the subject is third person 
(nowadays only where there is no object, but formerly also where the object was 
third person), there are distinct interrogative and indicative forms; but in no case 
where the subject is first person is there a separate interrogative form. 
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Sadock proposes the hierarchy in (55). 
(55) 2>3>1 


Questions are most likely to refer to a second person and least likely to refer to a first 
person. This should not be interpreted as a strict implicational hierarchy, however, which 
allows no exceptions. Nevertheless, it seems to be a valid tendency that had actually 
already been discovered by Bolinger (1957: 3): “You occurs oftener than not in Qs. It 
therefore ‘means’ ‘question; loosely and insufficiently, but enough so that a locution not 
otherwise identifiable as a Q becomes one (is reacted to as one) if you is present.’ This 
highlights the social aspect of questions, which are strongly rooted in communicative 
interaction, and has an analogue in the gazing behavior of the questioner. Rossano et al. 
(2009: 239), based on the investigation of the three very different speech communities of 
Italian (Indo-European), Yéli Dnye (no affiliation), and Tenejapan Tzeltal (Mayan), found 
that it is especially the questioner who is gazing at the addressee (instead of the other 
way around), which is in accordance with my subjective impression for conversations 
in German. Recently, Baranesa et al. (2015: 81) additionally found “that higher curiosity 
was associated with earlier anticipatory orienting of gaze toward the [expected] answer 
location”. These facts are also consistent with the explanation of questions as a form 
of exploratory behavior in the dialogical array because most other types of exploration 
involve some kind of active looking. As Gibson (1979: 212) put it, “looking is always 
exploring”. 
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This chapter takes a closer look at the grammar of questions in language families of NEA 
in alphabetical order, from Ainuic (§5.1) to Yeniseic (§5.14). Each section on a language 
family is divided into three parts, a brief introduction that sketches its internal classifi- 
cation, a section on question marking, and one on interrogatives. For practical purposes, 
subsections in the larger language families Indo-European (§5.5) and Trans-Himalayan 
(§5.9) are distinguished additionally into subbranches such as Germanic or Sinitic. The 
part on Yeniseic has an additional subsection on the Dene-Yeniseian hypothesis (§5.13.4). 
Please note that, except for perhaps Tungusic, the classification of each language family 
is not exhaustive and is mostly intended as a tool that allows to better understand the 
internal order of the individual subsections. 


5.1 Ainuic 


5.1.1 Classification of Ainuic 


Ainuic has three dialect groups that are named after their geographical distribution. 
These are the Sakhalin dialects, Kuril dialects, and Hokkaido dialects. Excluding the pos- 
sible existence of now extinct Para-Ainuic varieties on Honshü, the Ainuic language 
family may roughly be classified as follows (cf. Vovin 1993: 157, see also Figure 5.1 in 
§5.1.3). 


— tSakhalin dialects 
IR —— ‘Kuril dialects 
—— Hokkaido-Kuril + 


L (DHokkaido dialects 


According to Shibatani (1990: 4) there is what he calls “Classical Ainu’, the language 
of oral epics (yukar), which differs from the spoken language and allegedly represents 
older stages of development. But Nakagawa & Okuda (2007: 378) claim that 


it is misleading to describe the grammar of Ainu as resting upon this distinction, 
because the behaviour and distribution of so-called “classical” features are actually 
independent from each other. There is no sound evidence to support the claim that 
the "classical" features are really older than *colloquial" ones in the history of this 
language. 
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The following description of question marking is mostly based on the Saru and Chi- 
tose dialects in southwestern Hokkaido as well as the Shizunai and Tokachi dialects in 
southeastern Hokkaido. The Sakhalin and Kuril dialects will only be mentioned briefly. 
A more complex picture including almost all dialects can be drawn for the interrogative 
system. 


5.1.2 Question marking in Ainuic 


For marking polar questions, the Saru dialect of Ainu has final rising intonation com- 
bined with an optional final particle ya. 


(2) Ainu (Saru) 
nisatta nupurpetor un e=arpa ya? 
tomorrow PN place ALL 2sG.S=go.PL Q 


"Will you go to Noboribetsu tomorrow?’ (Bugaeva 2012: 497) 


The online Topical Dictionary of Conversational Ainu based on the Saru dialect contains 
a section called Question and Answer from which the following example of an unmarked 
polar question with a longish and slightly rising intonation towards the end was drawn. 


(3) Ainu (Saru) 
ku=ye itak e=raman? 
1sc.A-say language 2sc.A=know 


‘Do you understand what I am saying?’ (NINJAL 2015) 


Interrogatives are in situ and there is usually no additional morphosyntactic marking, 
though the use of ya is possible. Alternative questions exhibit double marking with ya. 


(4) Ainu (Saru) 


a. hunna ek? 
who come 


‘Who came?’ 
b. hemanta e=e rusuy ya? 
what 2sc.S-coP want Q 


"What would you like to eat?' (Tamura 2000: 235, 236) 
c. ek ya, somo ya? 
comeQ NEG Q 


‘Are (you) coming or not?’ (NINJAL 2015) 


The use of the same final particle for both polar and content questions has an areal 
connection to surrounding languages ($6). Alternative and (truncated) polar questions 
are both marked with -he. 
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(5) Ainu (Saru) 
matci=he e=kor rusuy tampaku=he e=kor rusuy? 
match=Q 2sc.S=have want cigarette=Q 2sc.S=have want 


‘Do you want the matches or the cigarettes?’ (Tamura 2000: 234) 


(6) a. toanpe ekte. 
that thing hand.over 
‘Give that to me: 
b. tan pe-he? 
this thing=Q 
"Ihis one?' (Tamura 2000: 234) 


In one recorded open alternative question, perhaps because -he cannot combine with 
interrogatives, only the first alternative takes the marker =he. 


(7) Ainu (Saru) 
te ta=he, hunak ta? 
here Loc=Q where Loc 


‘Here or where? (NINJAL 2015) 


An additional marking of polar questions grammaticalized from nominalization has par- 
allels in Japanese (see §5.6.2). According to Bugaeva (2012: 497) the final copula ne may 
be omitted following the evidential infinitive marker ruwe (ne) ‘it is a fact that’, which 
in turn seems to mark polar questions. The same pattern can be observed in the Chitose 
dialect (Bugaeva 2004: 85). Tamura (2000: 233) claims that the same development is also 
possible with other evidential markers, notably hawe (ne) ‘it is said that’ and siri (ne) ‘it 
looks that’. It appears that the newly grammaticalized question markers may be present 
in polar, alternative, and content questions. 


(8) Ainu (Saru) 
a. yosiku e=ne ruwe? 
PN 2sc.S-coP Q 
"Are you Yoshiko?’ (Bugaeva 2012: 497) 


b. na tuyma ruwe, hanke ruwe? 
still far Q be.close Q 


‘Ts it far or near?’ 


c. makanak pak-no sir-tuyma ruwe? 
what till-apv appearance-be.far Q 


‘How far is it?’ (NINJAL 2015) 


The translation of the three markers above was taken from Bugaeva’s (2012: 494) descrip- 
tion, which contains yet another evidential marker humi (ne) ‘it feels that’ (see 17 and 24 
below). The nominalizers or evidential markers transparently derive from nouns, namely 
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“inferential ruw-e (< ‘the trace of’), reportative haw-e (< ‘the voice of’), non-visual (= sem- 
blative) hum-i (< ‘the sound of"), visual sir-i (< ‘the sight of’)” (Bugaeva 2012: 470). The 
evidential markers also appear in what seem to be tag questions, where they are followed 
by somo ne (ya). The use of the question marker is optional. 


(9) Ainu (Saru) 
e=sinki ruwe somone (ya)? 
2sG.S=be.tired INF.EV NEG COP Q 


"You're tired, aren't you?’ (Tamura 2000: 233) 


In addition to the question markers mentioned above, there is a special copula an that 
replaces the plain copula ne in questions. Special interrogative copula forms are also 
known from several Mongolic languages (85.8.2) as well as Shuri (85.6.2). 


(10) Ainu (Saru) 
núman hunna ek ruwe an? 
yesterday who ` Come SG INF.EV COP.Q 


‘Who came yesterday? (Bugaeva 2012: 497; Tamura 2000: 237) 


For the Chitose dialect Bugaeva (2004: 88) mentions the fact that the special copula 
is usually encountered after one of the evidential markers mentioned above, though see 
example (17b) for a counterexample of the Tokachi variety. The copula can also appear 
twice in alternative questions. 


(11) Ainu (Saru) 
ooho pet an, ohak pet an? 
be deep river coP.Q be.shallow river cor.o 


‘Ts it a deep or a shallow river?’ (NINJAL 2015) 


According to Batchelor (1905: 141), the enclitic =he “expresses interrogation, and is 


»» 


often though by no means always, followed by the verb an 'to be: 


(12) Ainu (Saru) 
tan kur aynu itak eraman kur=he an? 
this person PN language know person=Q CoP.Q 


‘Does this person understand Ainu?’ (NINJAL 2015) 


A question construction specialized for inquiring about topics is hike (mak)? ‘how 
about’ (Tamura 2000: 237), which appears to take a sentence-final position. As we will see 
in §5.1.3, mak is actually an interrogative meaning ‘how, why’, while hike is a conjunction 
with the meaning ‘and’ (Bugaeva 2012: 497). 

In the Chitose dialect, few polar questions are marked with rising intonation alone. 
In most cases it is combined with the same final question marker ya as seen above. 
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(13) Ainu (Chitose) 
tan-to | e-nepki humi pirka a ya? 
this-day 2sc.S=work Ev.N be.good PFV.PL Q 
‘Did you work well today?’ (Bugaeva 2004: 85) 


Content questions in the Chitose dialect are also said to exhibit the marker ya more often, 
as compared with the Saru dialect (Bugaeva 2004: 86). There are, nevertheless, content 
questions without the marker. 


(14) Ainu (Chitose) 
eani hunna e=ko-ysoytak? 
2sG who 2sG.S-to.APPr-talk 


"Who are you talking to?' (Bugaeva 2004: 86) 


Alternative questions have the same particle he as seen in the Saru dialect above. But 
Bugaeva mentions an example of an alternative question which in addition exhibits the 
question marker ya following each alternative. Altogether there are thus four question 
markers. Apparently, =he attaches to the focus, while ya can be found in final position 
after each alternative. 


(15) Ainu (Chitose) 
seta=he ne ya, kamuy-he ne ya? 
dog-Q CopO god-o COP Q 
'Is it a dog or a god?' (Bugaeva 2004: 88) 


The Shizunai dialect also has the question marker ya in sentence-final position, which 
seems to have the same semantic scope as seen before. 


(16) Ainu (Shizunai) 
numan ekasi nep kar ya? 
yestderday old.man what do o 


"What did the old man do yesterday?' (Refsing 1986: 229) 


As in the Saru and Chitose dialects, there is a connection of questions to nominaliza- 
tions, i.e. ruwe, siri, hawe, and pe. The first three correspond to the Saru forms mentioned 
above while the last one is similarly neutral like ruwe. The difference between the two 
is the level of abstractness, pe referring to concrete and ruwe to abstract objects (e.g., 
Refsing 1986: 229f.). The copula an is attested as well. 

A recent treatment of the Tokachi dialect in southeastern Hokkaido mentions sev- 
eral questions that exhibit no significant difference from the other dialects already men- 
tioned. 
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(17) Ainu (Tokachi) 

a. nep tap eci-ki sir okay ya? 
what EMPH 2PL.A-do EV.N COP Q 
‘What are you doing?’ 

b. oupeka ekaci e=cakoko a ru etap an? 
straight child 2sG.A=teach PFV.PL EV.N EMPH COP.Q 
‘Did you teach the children correctly?’ 

c. e=cikir-i arka ru=he? 
2sc.S=foot-poss hurt EV.N=Q 
‘Are your feet hurting?’ (Takahashi 2013: 131) 


Based on this similarity, one may speculate that alternative questions presumably dis- 
play double marking with =he and that the question marker ya also marks polar ques- 
tions. Tokachi Ainu has yet another question marker a not encountered thus far. In all 
examples given, it follows the copula an and marks content questions. 


(18) Ainu (Tokachi) 
nen tap apustakik human a? 
who EMPH door knock Ev.N COP.Q Q 


‘Who knocked on the door?’ (Takahashi 2013: 131) 


Apparently, the marker also exists in other dialects such as Saru. The following exam- 
ple illustrates that it can also appear in polar questions. 


(19) Ainu (Saru) 
arki rok a? 
come.PL PFV.PL Q 


‘Have they come?’ (Shibatani 1990: 79) 


Other Hokkaido dialects seem to exhibit a pattern very similar to those already ob- 
served, though there usually is only little information available. For example, the Samani 
dialect also has the marker ya and the special interrogative copula ’an, but additional 
information on further question markers and their semantic scope remain obscure (T. 
Tomomi 2002: 101, 107). 

For Sakhalin Ainu, the materials collected by Konada (Tittel 1922) contain the three 
question markers a, ya, and he. We have already encountered all three markers above in 
several Hokkaido dialects. Their semantic scope remains unclear but may be similar to 
Hokkaido dialects as well. 


(20) Ainu (Sakhalin) 


a. pirika a? 
be.good Q 
‘Is it alright?’ 
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b. tam-pe ne ya? 
this-thing cop Q 
‘Ts it this thing?’ 
c. e-nu-he? 
2sc.A=hear=Q 
‘Do you hear?’ (Tittel 1922: 85) 


For the Kuril dialect of Ainu, there is a content question that was originally recorded 
by Voznesenskii. Apart from the interrogative, no marking is present. 


(21) Ainu (Kuril) 
nie-bie-gor? 
what-thing-have 
"What thing is there?’ (Vovin 1993: 199) 
No information on other question types in this dialect group seems to be available. 
Table 5.1 summarizes the limited information of Ainuic question marking that we 


have seen above. The semantic differences between different markers of polar questions 
as well as the exact semantic scope for most forms remains obscure for now. 


Table 5.1: Tentative summary of question marking in Ainuic 


Language PO CQ AQ 

Chitose Ainu ya yas 2x =he (+ 2x ya#) 
Kuril Ainu ? ?- 2 

Sakhalin Ainu yatt, at =he# ? ? 

Saru Ainu yaf, a# =he#, an coP.Q, Nf — yaf, an COP.Q, NÉ 2x -he, 2x ya#, 2xN# 
Shizunai Ainu yas, an COP.Q yas, an COP.Q ? 

Tokachi Ainu ?ya#, an COP.Q ya#, a#, an COP.Q  ? 


As usual, most question markers remain etymologically opaque, but Ainuic ya could 
be somehow related to Old Japanese =ya (§5.6.2). A problem for the comparison is, 
however, a different morphosyntactic behavior and semantic scope of the Old Japanese 
marker that is a mobile enclitic not found in content questions. 


5.1.3 Interrogatives in Ainuic 


The sets of interrogatives in the three dialects Saru, Chitose, and Shizunai mentioned in 
the previous section are very similar to each other (Table 5.2). For the Tokachi dialect 
Takahashi (2013) only mentions nen ‘who’, nep ‘what’, nekon ‘how’, and onon ‘whence’. 

From a synchronic point of view, the interrogatives are mostly opaque, but at least 
some forms are readily analyzable. The form nep kusu ‘why’ from the Chitose dialect 
consists of nep ‘what’ and kusu ‘because’. The Shizunai dialect in this expression has 
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Table 5.2: Saru (Tamura 2000; NINJAL 2015), Chitose (Bugaeva 2004), and 
Shizunai interrogatives (Refsing 1986) 


Meaning Saru Chitose Shizunai 
who hunna hunna hunna, nen 
where (to) hunak (ta), hinak (ta) hunak hunak, neyta 
what hemanta, hńta hemanta ‘what, why’ hemanta 
how much/many hempak hempak 
when hempara hempara hempara 
why hemanta ne nep (kusu) nepkus (ta) 
which inan, (h)inaan inaan 

how, why mak, makanak neun, makanak nekon 


Table 5.3: Sakhalin Ainu interrogatives according to Bronisław Piłsudski (Ma- 
jewicz 1998: passim) with tentative additional analysis based on Shibatani 
(1990) and Bugaeva (2012) 


Meaning Piłsudski 
which hemanu 

what hemat 

what, why, whose hemata ~ hematu 
why (kusu ‘because’) hemata kusu 
wherefore (kusu ‘because’) hemata ki-kusu 
how many hempak 

when, how hempara 

who inki an-ku(ru) 
whither nakan 

what nex 

what (ta ‘LOC, ALL’) nex-ta 

what, where from nejava 

where (ta ‘LOC, ALL’) nejta 

what nep 

what, how temana 

what, how, in what way temana-ka 
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an optional locative/allative case marker ta. Saru has a different formation based on 
the interrogative hemanta ‘what’ followed by the translative (Shibatani 1990: 36) or mu- 
tative (Bugaeva 2012: 476) marker ne that derives from the copula. Most forms have a 
resonance in h~. Perhaps, Ainuic thus not only belongs to the group of languages that 
have what has been called the KIN-interrogative (e.g., Saru hunna), but also exhibits K- 
interrogatives (86.2.1). However, the presence of the forms hunna(k) ‘who’ and hunnak- 
ta ‘where’ (Batchelor 1905), the latter with locative case marker, suggests that the form 
underlying both may have been a selective interrogative. Table 5.3 lists some Sakhalin 
Ainu interrogatives as recorded by Bronislaw Pilsudski. For the Sakhalin dialect, Tittel 
(1922: 77) only mentions a handful of forms that are more or less identical with those 
listed in Table 5.3. These data clearly show that there are also resonances in m- and 
especially n- as well (see also Batchelor 1905). 

Vovin (1993) reconstructs four interrogative stems for Proto-Ainuic, *gEm-, *gu[n]na, 
*in[a]-, and * nEE-, but the situation seems to be much more complicated than that. Alto- 
gether he assumes seven interrogatives that are based on these stems as “gEm= is thought 
to be the basis for the three different interrogatives “gEm=an=ta ‘what’, *gEm=pa=ra 
‘which’, and *gEm=pak=pE ‘how many’ (e.g., Horobetsu hemanta, henpara, and henpakpe), 
which is in accordance with Cysouw’s (2005) typology and suggests an original meaning 
‘which’ or maybe ‘what’. However, there are several problems with Vovin’s reconstruc- 
tions. Vovin does not comment on the morphology he reconstructs. The use of the equal 
sign instead of the usual hyphen for morphemes remains unclear as well. Furthermore, 
it is rather questionable whether an original bilabial nasal m should have developed into 
an n followed by a bilabial plosive in all dialects but one. In fact, exactly the opposite 
development would be expected. Perhaps the same is true for the initial consonant *g- 
that in almost all dialects mentioned has the form h-. Similarly, except for one dialect, 
the alleged interrogative *nEE= actually always has the form ne. The stem ne is said to 
mean both ‘who’ and ‘what’, which is rare from a typological perspective, but seems 
possible (Cysouw 2005; 2007). The interrogative "in[a]- appears to be mistaken, as there 
may have been an original initial consonant, e.g. Saru (h)inaan ‘which’. It may also be 
noted that Vovin's (1993) list of cognates is not exhaustive. There is an older but more 
complete description of interrogatives by Asai (1974: 64f.) that is given in Table 5.4. Fig- 
ure 5.1 indicates the geographical distribution of the personal interrogatives. 
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Table 5.4: Distribution of forms among dialects after Asai (1974: 64f.); 1 = 
Yakumo, 2 = Oshamambe, 3 = Horobetsu, 4 = Piratori, 5 = Nukibetsu, 6 = Ni- 
ikappu, 7 = Samani, 8 = Obihiro, 9 = Kushiro, 10 =Bihoro, 11 = Asahikawa, 12 = 
Nayoro, 13 = Sóya, 14 = Ochiho, 15 = Tarantomari, 16 = Maoka, 17 = Shiraura, 


18 = Raichishika, 19 = Nairo, 20 = Kuril, 21 = Chitose 


Meaning 


who 


what 


how 


when 


where 


Form 


nen 
hunna 

hunat 

naat 

nep 

hemanta 

hemata 

hinta 
nejanokonejanpi 
nekona 

nekon 

mak 

manak, makanak 
temana 

uiman 

henpara 
henparakanne 
henpaki 
nenpara 

nejta 

nahta 

hunakta 
hunakun 

huija 


Dialects 


1-3, 7-13 
4-6, 21 
20 

14-19 
1-3, 7-14 
3, 4, 6, 21 
14-19 

5 

20 

1-3, 9, 10 
7-13 

4-6 

21 

14-19 

20 

1-6, 11, 12, 14-16, 18, 21 
17, 19 

20 

7-10, 13 
1-3, 7-13 
14-19 

4, 6, 21 
5,21 

20 
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Kamchatka 


20 hunat 


Sakhalin 


Kuril 
14-19 naat Islands 


Hokkaido 


1-3, 
7-13 
nen 


4-6, 21 
hunna j^ 


Ainu 
topo- 
nyms 


Honshü 


Figure 5.1: Distribution of forms meaning ‘who’ after Asai (1974: 64f.) 


5.2 Amuric 


5.2.1 Classification of Amuric 


Nivkh is usually considered a linguistic isolate (e.g., Anderson 2006c), but there may 
be some reason to assume a connection to Chukotko-Kamchatkan languages (Fortescue 
2011) (85.3). Apart from that, there is perhaps enough internal variation to consider it a 
small language family that will be called Amuric (Janhunen 1996). However, these vari- 
eties are traditionally called dialects instead of languages (Gruzdeva 1998: 7). The relation 
of these so-called dialects has been characterized by Gruzdeva (1998: 7) as follows: 


AD and EspD are rather different: their speakers affirm that they do not understand 


each other. Nsp (or the Shmidt dialect) occupies an in[t]ermediate position be- 
tween these two. As for ssp (or the Poronaisk dialect), it has essential differences 
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in phonology, grammar, and vocabulary from the other three dialects, especially 
from AD. 


The Amur dialect has also been spoken on northwestern Sakhalin. Shiraishi (2006) has 
additionally argued for the existence of a West Sakhalin dialect (WSD) that is different 
from, but closely related to the Amur dialect (see also Shiraishi & Tangiku 2013). This 
has not been recognized by Fortescue (2016). In sum, there are the following varieties. 


-——— Amur dialect (AD) 
r—— Amur-West-Sakhalin + 


— West Sakhalin dialect (WSD) 


— North Sakhalin dialect (NSD) 
(22) | 


— East Sakhalin dialect (ESD) 


— South Sakhalin dialect (SSD) 


Most examples will be drawn from AD and ESD. The somewhat obscure transcription 
of some publications has been changed and roughly follows Shiraishi & Tangiku (2013: 
203). 


5.2.2 Question marking in Amuric 


According to Gruzdeva (1998: 45), Nivkh makes a distinction between two types of polar 
question markers. The first type is a suffix that directly attaches to the verb stem and has 
the form Jo! in both Amur and East Sakhalin dialects. The form -lo is more polite and 
ceremonious than -/, which seems to have a more colloquial flavor (Nedjalkov & Otaina 
2013: 116). 


(23) Nivkh 
tfi ra-l(o)? 
2sc drink-FIN.Q 
‘Did you drink?’ (Gruzdeva 1998: 45) 


(24) Nivkh (Amur) 
if p’ri-I(o)? 
3sc come-Q 
‘Did (s)he come?’ (Nedjalkov & Otaina 2013: 116) 


The second type attaches to a finite verbal form or other elements in focus. It has the 
form =I(a) ~ =lo in the Amur dialect and the form =/(a) ~ =lu in the East Sakhalin dialect. It 
was also written with a hyphen but is reanalyzed as enclitic here. The semantic difference 
between the two markers, which are perhaps etymologically connected, remains unclear. 
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(25) Nivkh (Amur) 
tfi ra-d-la? 
2sG drink-FIN=Q 
"Did you drink?' 


(26) Nivkh (East Sakhalin) 
tfi ra-d-lu? 
2sc drink-FIN=Q 
‘Did you drink?’ (Gruzdeva 1998: 45) 


Polar and focus questions have the same marker that attaches to the verb in the former 
and to the element under focus in the latter. 


(27) a. Nivkh (Amur) 
itik — p'ri-d3-la? 
father come-IND=Q 
"Has father come?’ 

b. itik-la  p'ri-d3? 
father=Q come-IND 


‘Ts it father who has come?’ (Nedjalkov & Otaina 2013: 124) 


Content questions may be unmarked if they have a special intonation that was left un- 
specified by Gruzdeva (1998: 46). Otherwise they have a question marker different from 
that for polar questions (Amur -7a, =at(a), East Sakhalin =na, =nu, =ara). The markers 
may either attach to the verb or the interrogative (phrase). They have been reanalyzed 
as enclitic here. Interrogatives remain in situ. 


(28) Nivkh (East Sakhalin) 


a. tf'in t'amdsi  p"-vo-ux tamdsi na tur'"-pir'k 
2PL what.kind nErr-village-1oc what.kind animal meat-?only 
fi-t^a-d-yun? 
eat-HAB-IND-PL 
"What kinds of animal meat do you eat in your village?' (Chae 2013: 132) 
b. tf'i ta.k-toy | vi-d- ga? 
2sG where-DAT go-IND-Q 


"Where are you going (roughly)?’ (Gruzdeva 2008: 182) 


(29) Nivkh (Amur) 


a. ag p'ri-d5-at? 
who come-IND=Q 


"Who came?’ 
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b. tfi sidz=ya j-isru-d3? 
2sG what=Q OBJ-pursue-IND 
‘Whom do you pursue?’ (Gruzdeva 1998: 46) 


(30) Nivkh (Amur) 
sid3 pivy-ga jiv-d3? 
what person-Q have-IND 
“What (kind of) man is (here)? (Nedjalkov E Otaina 2013: 124) 


The existence of separate and overtly marked polar and content question markers seems 
to have been adopted by the Tungusic language Uilta (§5.10.2). 

No clear examples for tag questions and only one example for a negative alternative 
question have been found. The analysis of this example from von Glehn (Grube 1892: 31) 
remains partly obscure for me but is sufficiently clear to show that there is no disjunction 
and that each alternative takes a marker lo. In the Amur dialect this may either corre- 
spond to the enclitic =l(a) ~ =lo or to the suffix -I(o). However, Nedjalkov & Otaina (2013: 
125, 209) mention a suffix -lu found in the Amur dialect, misleadingly called “particle” 
despite being given with a hyphen, that seems to have dubitative meaning and marks 
indirect alternative questions. Given that it may also have the form -lo, it seems possible 
that this is the form recorded by von Glehn. 


(31) Nivkh (Amur) 
[tu-ni-d3-lu qa-ni-d3-lu] planp'ara-r 
go.upstream-FUT-N-DUB go.downstream-FUT-N-DUB not.know-CVB.NAR.3sG 
hum-d3. 
be-IND 
‘He does not know [whether to go upstream or downstream]. (Nedjalkov & 
Otaina 2013: 209) 


An etymological connection to the other two question markers seems likely but to my 
knowledge there has not been an investigation of this topic. The same marker also ap- 
pears in indirect polar questions (Nedjalkov & Otaina 2013: 220) and content questions 
(see 33a,b below). This quite clearly shows that it should be kept apart from the actual 
question markers. On the contrary, it may be a marker for indirect questions, exclusively. 
Rhetorical questions in Nivkh are marked with -rla ~ -tla. 


(32) Nivkh (Amur) 
if p’ri-rla? 
3sc come-Q 
‘Did (s)he really come?’ (Nedjalkov & Otaina 2013: 116) 


A special marker that is said to expect a positive answer and thus perhaps comes close 
to a question tag is (probably sentence-final) <y> as recorded by von Schrenck (Grube 
1892). Austerlitz (1956: 262) mentions a marker =ii, reanalyzed as enclitic here, that he 
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translates as ‘isn’t it?’ and it might be the same as <y>, e.g. nav=ii? ‘a sparrow’s nest, 
isn’t it?’. The Tungusic language Uilta (§5.10.2) not only has a content question marker 
-ga ~ =ka that most likely derives from Nivkh =na (§3.1), but also has a polar question 
marker =(y)i that could to stem from this enclitic in Nivkh. 


Table 5.5: Summary of question marking in Amuric. 


PO CO AQ FQ 
AD V=l(a)/=lo =ya, -at(a) 2x =l(a)/=lo, ?  FOC-l(a)/-lo 
2x =lu/=lo 
ESD V=l(a)/=lu =ya,=yu,=ara ? FOC=|(a)/=lu 


Slightly adjusting Fortescue’s (2016: 79, 172) reconstructions, Proto-Amuric must have 
had the question markers *=la ~ =lo, *-rla ~ -rlo, *=na, =ata, and *-i with somewhat 
unclear distribution. 


5.2.3 Interrogatives in Amuric 


Descriptions of interrogatives in Nivkh are usually insufficient, especially for the South 
and North Sakhalin dialects. Table 5.6 shows those forms collected by Mattissen (2003) 
and Fortescue (2016) to which WSD data has been added (Shiraishi & Tangiku 2013). The 
Amur and West Sakhalin dialects have a resonance in ř~ and the East Sakhalin dialect 
in t^- that go back to the same origin. Interrogatives meaning ‘what’ and ‘when’, and, 
except for ESD, also the interrogative meaning ‘who’ do not participate in this resonance. 
The resonance has been recorded as §~ by von Schrenck and as s~ by von Glehn (Grube 
1892). For example, von Schrenck had a form ša- ‘which, what kind of’ (AD řa-) as well 
as its regular locative form ša-in ‘where’ (AD fa-in, Fortescue 2011: 144). 

Fortescue (2016: 111) speculates that AD ar) derives from nar-na ‘who-q’. If correct, a 
typological parallel can be found in Korean (§5.8.3). ESD t*au-nt/-d ‘who’ is perhaps a 
secondary innovation based on the selective interrogative. Interestingly, almost all listed 
interrogatives are monosyllabic. But there are some longer forms as well, as the following 
two examples from the Amur dialect illustrate. 


(33) Nivkh (Amur) 
a. [jagut | imr p'ri-d5-lu] if p'anp'ara-d;. 


how.3PL 3PL come-IND-DUB 3sG not.know-IND 
‘He does not understand [how they came there]: 
b. [jagur p'ri-dg-lu] —p*anp"ara-d3. 
how.3sG come-IND-DUB not.know-IND 
‘(He) does not understand [how (he) came (there)! (Nedjalkov & Otaina 2013: 
220) 
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Table 5.6: Nivkh interrogatives according to Mattissen (2003: 14) and Fortes- 
cue (2016: passim), WSD according to Shiraishi & Tangiku (2013: 206); not all 
variants listed 


WSD 


Proto- AD ESD NSD SSD 

Amuric 

“nar an ~ ay ar nar ~ nař, nar ~ nař nat 

‘who’ tau-nt/-d 

*tu-nt si-dz si-tf ~ si-d ru-(n)t/-d ru-t, řu-t, ru-nt, lu-nt 
‘what’ su-t 

*ta-nt řa-dz ~ třa-dz  ?! t"a-d ? ? 

‘which’ 

“tanz~“tanr =‘ fa-ns fans Dans ~ ta-gs řa-ņspaklu tha-nk ~ fa-nkf 
‘how much/ ‘some’ ‘some’ 

many’ 

*ta- fa-r fa-y-fa-g — t"a-s ? fa-k, t"a-k 
‘where’ 

“ayr iyr iyr ayř ~ iyi irga axt 

‘when’ 

*ja-(ni-n) ja-gu-t/-r jaņ-gu-ni-tf ja-ni-ja-nf ja-na-gu-t ja-ni-n, jan-f, 
‘how, why’ ja-ni-g 


Also observe the dubitative suffix -lu used for indirect questions presented in §5.2.2. 
In jagur ~ jagut the element -r (2sc, 3sc) ~ -t (1sG, 1PL, 2PL, 3PL) is the narrative converb 
marker that is also part of the rhetorical question marker -r-la ~ -t-la previously noted 
(Nedjalkov & Otaina 2013: 40). The forms also contain an old causative marker -ku ~ -yu 
~ -gu ~ -xu that apparently has mostly lost its function (Nedjalkov & Otaina 2013: 42). Ap- 
parently, Gruzdeva (1998) and Mattissen (2003) do not mention any of these forms, but 
they have been listed as ja-ge-r (von Schrenck), jag(-o-r) (von Glehn), ja-g-r (Seeland), 
and jan-g-r (Lebedew) by Grube (1892). As in two of these examples, AD and WSD some- 
times contain a consonant -n which—Nedjalkov & Otaina (2013: 87) speculate—might be 
a dialectal difference. According to Fortescue (2016: 81), the -7 could be a participle form. 
Table 5.7 shows the paradigm of these forms as can be reconstructed with the help of 
different descriptions. 

But according to Shiraishi & Tangiku (2013: 206) there are also some longer forms such 
as WSD jap-gu-ni-tf ‘how’. The WSD suffix -tf is the same as AD -d3 ‘IND’ that attaches 
to what appears to be the future marker -ni (Nedjalkov & Otaina 2013: 209) or perhaps 
the verb -ni ~ -nu ‘to do’ as in SSD ja-ni-n (Fortescue 2016: 81). Nedjalkov & Otaina (2013: 
369) mention in addition an AD form jaar ‘why’ that must be related to these forms 


1Given the parallel in the AD and ESD, one may assume that the WSD has the form řa-tf ~ Go-d ‘which’. 
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Table 5.7: Simple AD and WSD interrogative paradigms of the form meaning 
‘how’ (Nedjalkov & Otaina 2013: 40, 220, Shiraishi 2006: 65, Shiraishi & Tangiku 


2013: 206) 
SG PL 
1 ja(y)-gu-t ja(9)-gu-t 
ja(y)-gu-r ja(9)-gu-t 
3 ja(y)-gu-r ja(y)-gu-t 


but has a long vowel and lacks the causative suffix (see Fortescue 2016: 81 for additional 
variants). According to Mattissen (2003: 238) the stem ja- (optionally with a derivation 
ja-y a- not encountered thus far) actually means ‘to do what’. The forms ja- as well as ja- 
go may also be employed as an attribute, e.g. AD ja-nivx ‘what person’, ja-so-daf ‘what 
kind of house’. These patterns are extremely similar to Mongolic (* ya-xu/n ‘what’, *ya-xa- 
‘to do what’, §5.8.3) and Tungusic (*ja-(kun) ‘what’, *ja- ‘to do what’, §5.10.3).? Possibly, 
the Nivkh forms are Tungusic loans that in turn derive from Mongolic. The converbal 
origin of forms meaning ‘how’ or ‘why’ might also suggest a connection with Mongolic 
or Tungusic. Within Nivkh there are completely parallel forms in the demonstratives, 
e.g. AD ho-(so)- ‘be like that, do thus’, ho(n)-gu-r/t ‘thus, in that way’ etc. (Nedjalkov & 
Otaina 2013: 87f.). 

Suffixes in the locative (AD řa-r, ESD t"a-s ‘where’) and the quantitative interroga- 
tives (AD ra-ns, ESD t*a-ns ~ t"a-gs ‘how much/many’) have parallels in spatial expres- 
sions and demonstratives, cf. AD tu-r ‘here’, hu-r ‘there’, tu-ns ‘this much’, hu-ns ‘that 
much’ (Gruzdeva 1998: 26f., 36), ESD tu-s, hu-s, and tu-nks, hu-nks with a slightly dif- 
ferent form (Gruzdeva 2008: 170). Mattissen (2003: 14) furthermore mentions AD řa-kr 
~ t"a-kr ‘where’ that has a suffix also known from spatial expressions and demonstra- 
tives, e.g. ESD tu-k* ‘here’, hu-kř ‘there’ (Gruzdeva 2008: 181). The difference between 
-s and -kř is that the former designates a precise and the latter a non-precise location 
(Gruzdeva 2008: 178). Another suffix -nx roughly patterns with the latter in meaning, 
e.g. ESD t"a-nx ‘where’ (Gruzdeva 2008: 184). It is possible to attach a case marker such 
as the dative to the locative forms, e.g. ESD t"a-s-toy, t"a-k-toy ‘where to’ (Gruzdeva 
2008: 179, 182). Thus, similar to Tungusic the forms meaning ‘where’ are derived from 
the selective interrogative (AD řa-d3 ~ t"a-d3, ESD t”a-d). 

The forms meaning ‘what’ may be analyzed as a stem and the nominalizer (indicative) 
*-nt > AD -d3, ESD -nt ~ -(n)d etc. (Fortescue 2011: 1366). The same element is present in 
the selective interrogative and ESD t"au-nt, t^au-d ‘who’, as well as some demonstratives 
(Table 5.8). Notice that von Schrenck recorded the Amur dialect form meaning ‘what’ as 
si-é ~ si-né (Grube 1892), which preserves a nasal that is also present in ESD ru-d ~ ru-nt 
‘what’. 


? According to Nedjalkov & Otaina (2013: 209) and Fortescue (2016: 81), the initial j- is a third person singular 
marker—a hypothesis first proposed by Jakobson—while a- is the actual interrogative verb meaning ‘to do 
what’. But the connection with Tungusic and Mongolic makes this very unlikely. 
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Table 5.8: Amuric demonstratives and interrogatives “indicating a person or 
an object” (Gruzdeva 1998: 26ff.) 


AD ESD 
DEM.PROX ti-dz tu-d - tu-nt - ti-nt 
DEM.DIST 1 hi-dz hu-d ~ hu-nt ~ hi-nt 
DEM.DIST 2 a-dz ahu-d ~ ehu-d ~ ehi-nt 
DEM.DIST 3 aehi-dz aix-nt 
DEM. invisible ku-dz ku-d ~ ku-nt 
what si-dz ru-d - ru-nt 
which fa-d; ~ Dada Dad 
who - t'au-d ~ t"au-nt 


Demonstratives with the suffix may take number and case markers (e.g., AD ti-d3-O- 
yir ‘this-IND(-sG)-INsT’), without, they may function as attributive forms (e.g., AD ti urk 
‘this night’). Perhaps a similar situation can be observed for the interrogatives t"amdsi 
‘what kind of’ (Chae 2013: 135) versus t"amds5i-d ‘how’ (Fortescue 2011: 1372) in the ESD 
(similar to ja-d3 ~ ja- in AD, Mattissen 2003: 238). 

Fortescue (2011: 1371) assumes that Nivkh t"a-/ra- is related to *ózeq in Proto-Chukotko- 
Kamchatkan (e.g., Chukchi ráq, Alutor taq). He reconstructs a common proto-form for 
both as *ta(q)- (85.3.3). But as long as the hypothetical language family is not accepted by 
a majority of scholars, this must be treated with caution. Two interrogatives from Nivkh 
may have found their way into the Tungusic language Uilta (§5.10.3). The Uilta materials 
collected by Bronislaw Pilsudski contain the two forms nuulá ‘whither’ and sádo ‘where’ 
(Majewicz 2011: 388, 430). The second interrogative also has the form saa ‘where’ with a 
long vowel and is most likely a loan from West Sakhalin Nivkh ra-g ‘where’ (cf. Ikegami 
1997; Pevnov 2009: 122). Note that von Glehn recorded several forms starting with s- 
(Grube 1892). Allegedly, the ESD also has the forms nu-nt ~ nu-d ‘what’. Fortescue (2011: 
1372) speculates that these forms are actually indefinites and may contain a contracted 
form of the noun nə- ‘thing’. But if Uilta nuulu is indeed from Nivkh, it must be connected 
somehow to this form in the East Sakhalin dialect. 


5.3 Chukotko-Kamchatkan 


5.3.1 Classification of Chukotko-Kamchatkan 


Chukotko-Kamchatkan (or Luoravetlan) is a small family that includes five languages in 
two different branches (Fortescue 2003: 51f.; Anderson 20062). 
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Alutor 


Chukchi 


Chukotian 4 


*Kerek 


(34) | — Koryak (Nymylan) 


— "Eastern 


L Itelmen (Kamchadal) — *Southern 


— Western 


Itelmen formerly consisted of three different languages or dialect groups, of which all 
but the Western group have already become extinct. Kerek disappeared during the 1990s. 
Recently, it has been proposed that Amuric (85.2) may be distantly related to Chukotko- 
Kamchatkan (Fortescue 2011), but this hypothesis remains unproven. 


5.3.2 Question marking in Chukotko-Kamchatkan 


Given the lack of data on other question types, the following will focus primarily on 
polar and content questions. Alutor marks polar questions by means of probably rising 
intonation and an optional question particle. Unlike most other languages treated in 
this study, the particle does not stand sentence-finally but initially, which, except for 
some Indo-European languages (85.9.2), represents a stark contrast with NEA (Chapter 
6). Content questions have no question particle. 


(35) Alutor 
a. matka ta-lyu-ni? 
Q pot-kill.wild.reindeer-pot.pFv[3sc.S] 
"Will he kill a wild reindeer?’ 
b. miyya iv-i? 
who.ABS.SG say-3SG.S[PFV] 
"Who said (that)?’ (Nagayama 2011: 293, 294) 


In all Chukotko-Kamchatkan languages, interrogatives seem to take sentence initial po- 
sition, which likewise differentiates them from the rest of NEA. Interestingly, the initial 
question particle itself looks similar to Chukotko-Kamchatkan interrogatives starting 
with m~ (see §5.3.3). Fortescue (2005: 416) translates matka as ‘or’ and lists it with forms 
such as Chukchi mec- ‘somewhat’. While the exact derivation remains unexplained, there 
is also a Koryak form met(’)ke ‘or’ that appears to be a direct cognate of Alutor matka. 
Content questions are likewise unmarked in Koryak. 
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(36) a. Koryak 
met ke jenny e-jem-ke? 
Q maybe NEG-come-CIRC 
‘Perhaps (she) does not come?’ 
b. meki ib-i? 
who.ABS.SG say-3SG.S[PFV] 
‘Who said (that)?’ (Zhukova 1997: 51) 


It seems that in Chukchi and Kerek both polar and content questions are generally 
unmarked. 


(37) Kerek 
a. mil'ej jallaju-ni-n? 
gun take-PRr-3sc.O 
"Will you take this gun?' 
b. maki jaté-i? 
who.ABS.SG come-PRF 


"Who came?’ (Volodin 2001: 156, 157) 
(38) Chukchi 


a. koolo enmec ye-yjew-iyat? 
INTJ already Pr-awaken-2sc 
'My goodness, you're up already?' 
b. tite nan  potqen n-a-qit-a-qin? 
when DEICT DEM.3SG.ABS HAB-E-freeze-E-3SG 
"When does it freeze there?’ (Dunn 1999: 86, 72) 


Chukchi furthermore has an element otlon, glossed as a question marker, that appears 
in both polar and content questions and was translated as ‘on earth’, i.e. it adds a certain 
emphasis. It may also fuse with the interrogative 2mi ‘where’ to form the more complex 
emphatic interrogative 2mitlon ‘where on earth’ (Dunn 1999: 289f.). Its syntactic position 
is not absolutely clear, however, but seems to be relatively free. 


(39) Chukchi 
ana kakel, atlon (om, req-o-l'et-a-rko:n? 
so INTJ Q why what-E-DUR-E-PROG.VOC 


‘Oh my! Why, what on earth are you doing?’ (Dunn 1999: 55) 


It does not seem to be a true question marker, but nevertheless appears in interrogative 


contexts. Functional equivalents can be found in Yiddish (§5.5.2.2) and Tundra Nenets 
(85.12.2). 


Polar questions in Itelmen have final rising intonation but otherwise are identical to 
equally unmarked content questions. 
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(40) Itelmen 
kni-n qitkinen ci-ze-n? 
PP.2sG-POSS brother be.available-PRs-3sc 
‘Do you have a brother?’ (Georg & Volodin 1999: 214) 


Interrogatives in content questions optionally take a suffix -s, which is said to be a 
question marker that expresses additional emphasis. 


(41) Itelmen 
a. Ke c'e]-en kol-ki? 
who coM.v-2s6.3sG come-INF.V 
"With whom have you come?' 
b. manke-s kamma-n | mni] k-mec-knen? 
where-Q PP1sc-Possall iwr.III-disappear-crc 


"Where have all my (people?) gone?' (Georg & Volodin 1999: 134, 214) 


Itelmen is the only Chukotko-Kamchatkan language for which descriptions of focus 
questions are available to me. They follow an intriguing pattern that has a variable per- 
sonal marker on the verb. 


(42) Itelmen 
a. isx-enk  n-zal-at-in kza kama-nk? 
father-Loc 1MPRS-give-FUT-2sG.O0BJ.O you me-DAT 
"Will father give you to me?' 
b. isx-enk | n-zal-at-um kza kama-nk? 
father-Loc 1MPRs-give-FUT-1sG.08J.IO you me-DAT 
"Will father give you to me?' (Bobaljik & Wurmbrand 2002: 3) 


In this example, either the direct or the indirect “object” are represented with an agree- 
ment marker on the verb. The presence of the marker expresses the focusing of the re- 
spective constituent. 

Chukotko-Kamchatkan languages have a strong interaction of imperatives and ques- 
tion marking, which is yet another untypical feature for NEA. For example, Nedjalkov 
(1994: 325) mentions the interesting fact that imperative verb forms in Chukchi may 
appear in content questions where their meaning changes to marking future tense. 


(43) a. Chukchi 
myn-le-rkyn? 
IMP.ÍPL-gO-IPFV 
‘Let us fly!’ 
b. minky.ty myn-le-rkyn? 
over.where IMP.1PL-g0-IPFV 


‘Over what place shall we fly?’ (Nedjalkov 1994: 325) 
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Georg & Volodin (1999: 171) claim that imperatives in Itelmen may also have a future 
and prospective meaning, but this does not appear to be restricted to questions. The 
phenomenon in Chukchi has a more straightforward parallel in the more closely related 
language Kerek, for which Volodin (2001: 158) noted the following phenomenon: 


Interrogative sentences in Kerek are often viewed as a special type of imperative 
utterances that presuppose a speech response. Any interrogative sentence can be 
interpreted as a reduced imperative sentence of the type “Tell (answer) me, if... 
This view may be confirmed by the strong formal ties existing between impera- 
tive and interrogative meanings demonstrated by Chukchi-Koryak (and Chukchi- 
Kamchatkan) languages. 


In both Kerek and Chukchi the imperative markers in questions exhibit an additional 
modal overtone such as ‘can’ or ‘must’ (Volodin 2001: 157). 


(44) Kerek 
manka na-xaxau-n? 
why ` 1MP.3sG-go-3sc.S 


"Why does he have to go?' (Volodin 2001: 156) 


The imperative marker is not obligatory, however, and as in Chukchi all examples pro- 
vided by Volodin are content questions. Whether this feature is shared by Alutor and 
Koryak remains unclear for now. Interestingly, interrogative morphology in the adja- 
cent Yukaghiric languages (see $5.14.2) as well as in Central Alaskan Yupik (85.4.2) is 
also restricted to content questions. See also §5.10.2 on Even, a Tungusic language that 
had contact with Chukotko-Kamchatkan and exhibits the use of imperative forms in 
questions as well. 

The marking of questions in Chukotko-Kamchatkan summarized in Table 5.9 exhibits 
no similarities to Amuric or to most of NEA, for that matter. 


Table 5.9: Summary of question marking in Chukotko-Kamchatkan 


Language PO CQ 
Chukchi - - (IMP-V) 
Alutor #matka - 

Kerek - - (IMP-V) 
Koryak #met’ke - 
Itelmen - (#INT-s) 


5.3.3 Interrogatives in Chukotko-Kamchatkan 


Several Proto-Chukotko-Kamchatkan (PCK) interrogatives have been reconstructed by 
Fortescue (2005). Table 5.10 lists them with cognates from all five languages, but not all 
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variants and only singular forms are shown. Each language has some additional forms, 
e.g. la’ Isxe’n ‘how much/many’, manke ‘whence, how’, manx'al ‘whither’, anga ‘what’, 
and anqan-kit ‘what-caus > why’ in Itelmen (Georg & Volodin 1999: 136, passim), man- 
ki, maja ‘where’, man-kat(in) ‘whence’, man-kepan ‘whence, along where’, mar-in'as 
‘how many, how long’, and tafar ‘how much’ in Alutor (Nagayama 2011: 293f.), and ‘emi 
‘where’, i'am ‘why’, mik-a-ne ‘whither’, t'er ‘how much/many’ etc. in Chukchi (Dunn 
1999: 66, passim). The most important resonance of Chukotko-Kamchatkan languages is 
m-. 


Table 5.10: Proto-Chukotko-Kamchatkan (PCK) interrogatives and their cog- 
nates in individual languages according to Fortescue (2005: 56, 173, 175ff., 287) 
and Dunn (1999; 2000) 


Meaning PCK Chukchi Kerek Koryak Alutor Itelmen 
whatkind *mæņin menin marin merin marin min 
who "mike mik(a)- maki meki, miyya Ke 
(ABS.SG) maki 
(Kamen) 
where *"mig(ko) minke mirkiil minkə minkə ma?, mank 
(ALL) 
how *"migkeói  migkori minkii minkaje ?manket ?mank 
what *ðæq- req-/ceq- jaq- jeq- taq-, teg- saq 
(Palana) 
when *titæ tite sita tite tita it’e 


Fortescue (2005: 263, 282) reconstructs, furthermore, Proto-Chukotian (PC) stems that 
lack a cognate in Itelmen, i.e. PC *ræmi ‘where’ (Chukchi 7emi, Kerek Xam, and Ko- 
ryak hemmi, Alutor -) and PC *tærər ‘how much’ (Chukchi t'er, Kerek t’aj, Koryak teni, 
and Alutor tarar). Itelmen likewise exhibits interrogatives without clear equivalents in 
Chukotian such as one meaning ‘what’ (Eastern nkc, Southern nakxej, and Western árqa, 
Fortescue 2005: 399). Fortescue (2011: 1372) compares PCK *ðæq- ‘what’ with Nivkh t^a- 
/Fa- (85.2.3) and tentatively reconstructs PCKA “ta(q)-. However, this reconstruction is 
still too speculative, given that the genetic connection between the two families has not 
been proven beyond doubt. This stem in Chukotko-Kamchatkan cannot only have nom- 
inal but also verbal properties. 


(45) Alutor 
yatta taq-atkan? 
2SG.ABS.SG what-1Prv [2sc.S] 


"What are you doing?' (Nagayama 2011: 294) 
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(46) 


(47) 


(48) 


Chukchi earlier made a characteristic difference between req- as used by men and ceq- 
as used by women (Kampfe & Volodin 1995: 8). But this is just the effect of a more general 
pattern in which women pronounced r as c that seems to have been lost by now (Dunn 
2000). Another language in Northeast Asia that makes some distinctions between the 
grammar of questions of women and men is Japanese (§5.6.2). Similar to Ket (§5.13.3), 
interrogatives can be incorporated into the verb. When incorporated the meaning of 


Koryak (Kamenskoye) 

ni-ya’q-iyi? 

HAB-what-2sG 

"What are you doing?’ (Bogoras 1922: 730) 


Koryak 

n-re’q-iyit? 

HAB-what-2sG 

"What are you doing?’ (Bogoras 1922: 730) 


Chukchi 

na-req-iyot? 

HAB-what-2sG 

‘What are you doing?’ (Dunn 1999: 368) 


req-/raq- ^ r’e-/r’a- changes from ‘what’ to ‘why’. 


(49) 


As examples (49a) and (49c) illustrate, the meaning ‘why’ is otherwise expressed with 
the dative form of the interrogative. See §5.8.3 and §5.10.3 for a somewhat similar devel- 


Chukchi 

a. raq-eta  na-wetgawe-g at? 
what-DAT PRs-speak-2sc.S 
"Why do you speak?' 

b. na-raq-a=wetgawe-g’at? 
PRS-Wwhat-E-speak-2sc.S 
"Why do you speak?' 

c. r'a-eta  ata-g at? 
what-DAT come-2sc.S 
"Why did you come?' 

d. r’a=nata-g’at? 
what-come-2sc.S 


"Why did you come?' (Spencer 1995: 457, from Skorik) 


opment in Khorchin and Manchu. 


Interrogatives in Chukotko-Kamchatkan languages have elaborated paradigms (see 
Nagayama 2011: 293f. on Alutor; Bogoras 1922: 726ff. on Koryak; Georg & Volodin 1999: 
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134-136 on Itelmen). In Chukchi the paradigms correspond to the second [+Hum] and 
first declension [+/-Hum] of nouns, respectively (Table 5.11). In order to make clear the 
distinction found in the second declension into collective suffixes on the one hand and 
number/case suffixes on the other, the sign @ indicates which of the markers is absent. 
The layering of suffixes follows the order v-coLL-NUM/CASE. The first declension has no 
collective suffixes. Locative interrogatives and demonstratives have parallel paradigms 
(Dunn 1999: 286f.), e.g. nut-ku ‘DEM.PROX-LOC’, nen-ku ‘DEM.DIST-LOC’, and min-ke ‘where- 
Loc’. The ablative (mer-qo(ra)) and allative (min-kari) have the same forms throughout. 


Table 5.11: Chukchi interrogative paradigms according to Kampfe & Volodin 


(1995: 87) 
Glossing who (2nd decl.) what (1st decl.) 
stem mik- req- 
ABS.SG meni-O-n r'etnyt-Ó 
ABS.PL miky-O-nti r'etnyt-et 
LOC-ERG (-COLL) miky-ne-O req-e (INST-ERG) 
LOC-ERG (+COLL) miky-ryk-@ req-yk (Loc) 
ABL (-COLL) mek-@-gypy, (meky-na-jpy) r'a-/raq-gypy 
ABL (+COLL) meky-r-gypy - 
ALL (-COLL) meky-na-@ (/-gty) raq-ety 
ALL (*COLL) meky-ryk-y ; 
ORIENT (-COLL) miky-O-gjit reqy-gjit 
ORIENT (+COLL) miky-ry-gjit = 
DESIG (+/-COLL) miky-@-ny req-y 


In Alutor, participle forms of the interrogative verb may take case markers as well. 


(50) Alutor 
annin taq-a-l?-u qa  paninal?-u? 
well what-E-PTCP-ABS.PL EMPH ancestor-ABS.PL 


"Well, what did (our) ancestors do?’ (Nagayama 2016: 133) 
Predicatively used interrogatives can also take person and number markers. 


(51) Alutor 
mik-ine-yot yatta ` ununlu-jyot? 
who-POSS-2SG.PRED 2sG.ABS child-2sc.PRED 
"Whose child are you?’ (Nagayama 2016: 121) 


Unlike Chukchi or Itelmen, but similar to Aleut (85.4.3), Alutor and Koryak not only have 
plural but also dual forms. 


127 


5 Survey of the grammars of questions in Northeast Asia 


In sum, Chukotko-Kamchatkan interrogatives deviate strongly from other NEA lan- 
guages. No K-interrogatives are present and only Itelmen k’e has been tentatively classi- 
fied as a KIN-interrogative, although it likely derives from what has been reconstructed 
as PCK “mika. Complex paradigms with sandhi effects, ergative marking, dual num- 
ber (e.g., Koryak ma’ki ‘aBs.sc’, ma’kinti Ans, Du, maku’wyi ‘ABs.PL’, Bogoras 1922), and 
incorporation set Chukotko-Kamchatkan apart from most other languages in NEA. How- 
ever, ambivalent interrogative stems meaning ‘(to do) what’ are shared with Tungusic, 
Eskaleut, and Samoyedic. Especially Itelmen exhibits an opaque interrogative system 
that resists any synchronic attempt for analysis. An exhaustive diachronic analysis can 
only be accomplished by experts on the language. 


5.4 Eskaleut 


5.4.1 Classification of Eskaleut 


The Eskimo-Aleut or Eskaleut language family may be classified as in Figure 5.2 (Berge 
2006; 2010; Fortescue 2013; and especially Fortescue et al. 2010: xiif.). 

Languages spoken in Northeast Asia are signaled with an asterisk, but for the purpose 
of better understanding, Central Alaskan Yupik will be included in the discussion as well. 
For a more fine-grained classification of subdialects see Fortescue et al. (2010: xiif.). The 
primary split is between Aleut on the one hand and Eskimo on the other. Eskimo itself 
falls into two main branches, Yupik and Inuit. However, Sirenik(ski)—usually considered 
a part of Yupik—could possibly form a third branch of Eskimo (Fortescue et al. 2010: x). 
In general, the Aleut branch must be considered the most aberrant member of the family. 
Aleut historically formed a dialect continuum with linguistic diffusion from east to west 
but only three main dialect groups are sufficiently attested (Bergsland 1997: 14). Copper 
Island or Mednyj Aleut is a truly mixed language that contains a large number of Russian 
elements, including verbal morphology (e.g., Comrie 1981: 253; Golovko & Vakhtin 1990; 
Sekerina 1994; Golovko 2003; Vakhtin 1998), but is classified with other Aleut dialects 
here. 


5.4.2 Question marking in Eskaleut 


Eskaleut languages are famous for their interrogative mood, perhaps because of the well- 
known description of questions in West Greenlandic by Sadock (1984), but this is not 
present in all Eskaleut languages. Aleut has a mobile question particle, hi(i)’ ~ ii’ with 
final glottal stop in the Eastern dialect and ii in Attuan and Atkan (Bergsland 1997: 82) 
that marks polar and focus questions. It is reanalyzed here as enclitic because it freely 
attaches to the element in focus. As expected, the finite verb is focal in polar questions. 
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Eastern Aleut* 


Atkan* 
Aleut* 


— * Attu(an)* 


— Copper Island (Mednyj) Aleut* 
r— Naukan(ski) Yupik* 


= — Central Siberian Yupik* 
Yupik 


— Alutiiq Alaskan Yupik 


— Central Alaskan Yupik 


L. 'Sirenik(ski dos 
EN Sirenik(ski) (Yupik?) 


r— Seward Peninsula Inuit 


—— North Alaskan Inuit 


Inuit Western Canadian Inuit 


— Eastern Canadian Inuit 


L Greenlandic Inuit 


Figure 5.2: Classification of Eskaleut. 


(52) Aleut (Atkan) 
a. qilagan — piitra-£ hla- tuga-l  saga-na-&-ii? 
yesterday PN-ABS.sG boy-ABs.sG hit-?cvB AUX-REM-SG=Q 
"Did Peter hit the boy yesterday?' 
b. qilagan piitrax hlag tugal=ii sagana&? 
"Did Peter really hit the boy yesterday?' (Bergsland 1997: 83) 


No Aleut dialect content questions have an overt question marker. The sentence "Who 
are you?’, for example, is kiin ax(t)? in Atkan, kiin txin? in the Eastern dialect, and 
kiin tin? in Attuan, where only the interrogative kiin ^who' marks the sentence as a 
question. In Atkan the interrogative is followed by a second person form of the copula a- 
‘to be’ while in the other two dialects there is an overt second person singular pronoun 
that can be analyzed as t(x)i-n ‘DEM-2sc’ (Bergsland 1997: 57, 81, 89, 135). The marking 
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of questions in Aleut is thus typologically close to the Tungusic language Evenki, for 
example, although alternative questions contain a disjunction. 


(53) Aleut (Atkan) 
ting asxuunulax txin satxa€ tagaagan axs? 
1sG or 2sG gill net check.? be? 


‘Am I or are you going to check the gill net?’ (Bergsland 1997: 83, passim) 


Copper Island Aleut presents a special case because of its strong Russian impact. Un- 
fortunately, almost no information on interrogative constructions is available. As far 
as the few examples allow any conclusions, polar and content questions were probably 
unmarked.’ 


(54) Copper Island Aleut 
a. ya-gim bud-is=min’a su-t’? 
1sG-?REL.SG AUX-2sG-1sG.ACC take-INF 


"Will you take me with you?’ 


b. aqu-x ti ángicikay-usa-i$? 
what-ABs.sc 2sc think-TR-2sG 


"What are you thinking about?' (Golovko & Vakhtin 1990: 103, 104) 


The auxiliary bu(d)-, the personal ending -i$, the infinitive marker -t° as well as the pro- 
nouns ya, min'a (used as a verbal person marker), and ti are of Russian origin. 

The following short dialogue between a five year old child and her mother in Sirenik 
was recorded in 1985. The data show a mixed language that is comparable to Copper 
Island Aleut but might be more strongly based on Russian. The short dialogue includes 
an alternative question and an answer. There is juxtaposition of the two alternatives 
and there is no question marker or disjunction. Presumably, the question had a special 
intonation contour. 


(55) ?Central Siberian Yupik 


a. mam, ya èto quuv.a-yu, ya èto niv.a=y.u? 

mom 1sc this pour.out-1sc 1sc this pour.into-1sc 

‘Mom, shall I pour this out or shall I pour this (into something)?’ 
b. ladno, quuv=a.y. 

alright pour.out=IMP 

‘All right, pour it out? (Vakhtin 1998: 324) 


As the alternative questions appears to consist of two juxtaposed focus questions, we 
may surmise that focus and polar questions were marked by intonation as well. 

Before focusing on Yupik as spoken in Siberia, let us have a brief look at the better- 
known Central Alaskan Yupik language to establish a reference point. Polar questions in 
this language are marked with a second position marker that also marks focus questions. 


3Most elements in these examples are from Russian, except for those underlined, which derive from Eskaleut. 
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(56) Central Alaskan Yupik (General) 
a. tekít-üqz qaa nuk'aq? 
arrive-3SG.IND#Q PN.ABS.SG 
“Has Nuk'aq arrived?’ 
b. nuk’aq#qaa tekit-uq? 
PN.ABS.SGZQ arrive-3sSG.IND 


‘Has Nuk'aq arrived?’ (Miyaoka 2012: 168) 


Polar questions may also be marked with final rising intonation alone. The marker 
qaa has been translated as ‘right’ and may also mark tag questions. It is also optionally 
found on the first alternative in alternative questions and combines with a disjunction. 


(57) Central Alaskan Yupik (General) 
a. enér-pa-rqér-tuten, qaá/qáa? 
house-big-have-2sG.IND Q 
"You have a big house, right?’ 
b. qayar-pa-li-ug#qaa wall’u# pi-cuar-mek? 
kayak-big-make-3sc.IND#Q or# thing-small-ABM.sc 
‘Is he making a big kayak or a small one?’ (Miyaoka 2012: 170) 


Mild questions are marked with the suffix Jr. translated as ‘perhaps’ and topic ques- 
tions (polar or content) with =mi. 


(58) Central Alaskan Yupik (General) 

a. kai-lli-uten? 
hungry-INF-2sG.IND 
‘Are you hungry?’ 

b. kaig-tuten=qaa? 
hungry-2sG.IND-Q 
‘Are you hungry?’ 

c. u-na-mi? 
this-EX.ABs-Q 
‘How about this?’ 

d. nauwa=mi aata-ka? 
where=Q  father-ABs.1sG.sG.POSS 
‘(But) where is my father?’ (Miyaoka 2012: 173, 171) 


Content questions uttered in soliloquy contain an enclitic =kiy ‘I wonder’ that attaches 
to the initial interrogative. 
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(59) Central Alaskan Yupik (General) 
qaillun-kiq tai-ga? 
how=Q come-3sc.Q 
"How did he come over, I wonder?’ (Miyaoka 2012: 1360) 


Whether Central Siberian Yupik or Naukan Yupik share all of these question markers 
remains obscure from the limited and problematic publications available to me. 

As in this last example, and similar to Yukaghiric languages (85.14.2), content ques- 
tions exhibit an additional interrogative mood marking on the verb that replaces declar- 
ative endings. 


(60) Central Alaskan Yupik (General) 
a. nuk’aq ` tekit-uq. 
PN.ABS.SG arrive-3SG.IND 
‘Nuk’aq arrived: 
b. ki-na tekit-a-O? 
Who-ABSs.sG arrive-3Q-3SG.Q 
"Who has arrived?’ (Miyaoka 2012: 167) 


This last type of question marking shows that questions in Yupik are much more com- 
plicated than in Aleut as they combine special interrogative mood suffixes with special 
interrogative person endings. 

Morphological question marking in Yupik involves two layers of suffixes. The first is 
an actual question marker and attaches to the stem (Table 5.12), followed by the second, 
which is an agreement marker of person and number specialized for questions. Regard- 
ing the second layer, there is a distinction between intransitive and transitive paradigms. 
What is more, the first layer exhibits an additional distinction into different forms that 
depends on person as well. There are, furthermore, some complex morphonological pat- 
terns of interactions between the stem and the two layers of suffixes that cannot be dealt 
with here in detail, e.g. CAY niic+tat+yu ‘hear+3Q+3sc.S.3sc.0.Q’ > niitau ‘does (s)he hear 
it? (Miyaoka 2012: 1350). Table 5.12 lists the first layer of question marking. Generally, 
first and second persons are marked the same way, while third person receives another 
marker. 

The morphosyntactic behavior of the forms is quite complex, but has only been de- 
scribed in sufficient detail for Central Alaskan Yupik: “The initial /c/ of the first- and 
second-person mood markers is fricativized to /z/ after a vowel if the subject is singular 
and, if the subject is non-singular, (though with some variance) after a stem that ends in 
a stop plus /i/" (Miyaoka 2012: 1352) 

Intransitive interrogative agreement forms are given in Table 5.13. Apart from Sirenik, 
the individual affixes are very similar across the different languages. 

In Naukan Yupik the form -see might derive from a combination of -si(i) with -a. Note 
a parallel in the transitive paradigm below: -see '2Pr.S.2s6.O' = CAY +ci+na. As we have 
just seen in Table 5.12, CAY -ci corresponds to Naukan Yupik -si(i) (^ -jii). 


132 


5.4 Eskaleut 


Table 5.12: Simplified inventory of interrogative mood endings in Central 
Alaskan Yupik (Miyaoka 2012: 1352), Central Siberian Yupik (Jacobson 1979: 
61), Naukan Yupik (Menovshchikov 1975: 240ff.), and Sirenik (Vakhtin 2000: 


517) 

CAY CSY Naukan Sirenik 
3SG C +ta ~ V +ya +(t)a ~ +(g)a -aa -taa ~ -tsaa 
3PL id. id. id. id. 
3DU id. id. id. id. 
1SG *1ci +(t)zi -si(i) ~ -jii -sii ~ -tsii 
1PL id. +ste id. id. 
1DU id. id. id. id. 
2SG id. +(t)zi id. id. 
2PL id. +ste id. id. 
2DU id. id. id. id. 

Table 5.13: Intransitive interrogative person endings in Central Alaskan Yupik 

(Miyaoka 2012: 1352), Central Siberian Yupik (St. Lawrence Island, Jacobson 

Se 61), Naukan Yupik (Menovshchikov 1975: 240), and Sirenik (Vakhtin 2000: 
S CAY CSY Naukan Sirenik 
3SG +O +O -Ø +a 
3PL +t +t -t +i 
3DU +y +k -k - 
1SG +ga ma -see, -Ø +n 
LPL +ta +a -ta +ta 
1DU +nuy un -nuy - 
2SG +t +n -n +Ø 
2PL +ci +tsi -si +si 
2DU +tiy +k -tyk - 
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Tables 5.14, 5.15, 5.16 contain transitive interrogative endings from Central Alaskan 
Yupik, Central Siberian Yupik, and Naukan Yupik. 


Table 5.14: Transitive interrogative person endings in Central Alaskan Yupik 
(Miyaoka 2012: 1352); forms in parentheses are identical with the intransitive 
forms 


A/O 38G 3PL 3DU 1SG 1PL 1DU 2SG 2PL 2DU 
35G  +yu +ki +kiy +ga +kut +kuy +tin +ci +tiy 
3PL = +tyu +tki +tkiy *tga +tkut +tkuy +tyin +tci  +ttiy 
3DU  -yniyu +yniki +ynikiy +yga +ykut +yk+kuy +ytyin +ytci +yttiy 
ısG (+ga) (+ga) (+ga) +kint ? ? 

ıPL (+ta) (+ta) (+ta) ? ? ? 

1DU (+nuy) (+nuy) (+nuy) ? ? i 

25G +yu +ki +kiy +ga +kut +kuy 

2PL  +ci+yu +ciki +cikiy +ci+ġa +cikut +cikuy 


2DU +tiyu +tiki +tkiy +tiyna +tiykut +tiykuy 


Table 5.15: Transitive interrogative person endings in Central Siberian Yupik 
(St. Lawrence Island, Jacobson 1979: 61, 56); forms in parentheses are identical 
with indicative forms 


A/O  3sc 3PL 3DU 1SG 1PL 1DU 2SG 2PL 2DU 
35G  -gu +ki t+kek +ga *;nkut *;nkur 4ten ` +; Si *istek 
3PL — -tgu +tki +tkek +tya *;nkut *;nkur 4ten — «si +;stek 
3DU  -«gnegu +gneki +gnekek +gnena  -nkut *inkug +ten — tsi +jstek 
isc. — +kun +ngi +gngek -ken (-msi) (-mtek) 
IPL | *ggu +ki +kek -ken (-msi) (-mtek) 
1DU  -gnegu +gneki +gnekek *ken (-msi) (-mtek) 
2SG — *gu +ki +kek +ga +kut +kuy 

2PL | *ggu +ki +kek +yya +kut +kuy 


2DU  -gnegu +gneki -gnekek +gnena +gnekut +gnekuy 


In Sirenik, intransitive first person singular and second person plural forms are iden- 
tical to the declarative endings. Paradigms for transitive verbs are almost entirely un- 
known. Vakhtin (2000: 521) mentions +(gy)pyn’/+(rb1)mprH’ ‘2sc.A.1sc.O’, +n’/+H’ “3PL.A. 
1sc.0’, +kyn/+kprH ‘1sc.A.2sc.0’, +tyn/+TbIH “3sG.A.2sc.0’, and +gu/+ry °2/3sG/1PL.A. 
3sc.O’. Apart from the first, these seem to correspond to Central Alaskan Yupik +(t)na, 
+kin, +tin, and +(t/yni)yu, respectively (Table 5.14). 


^In the original table this form was given one row below, which seems to be a mistake (cf. Miyaoka 2012: 
1350). Miyaoka (2012) is not sufficiently clear about the gaps marked with a question mark here. The other 
languages show indicative forms here that roughly correspond to CAY -mt+yin '1Pr.2sc', -miytin “IDU.2sc’, 
-mci ‘1sG.2PL’, -mtci ‘1PL.2PL’, -miyci '1DU.2PV', -mtiy ‘IsG.2DU’, -mttiy ‘1PL.2DU’, and -miytiy ‘IpU.2DU’ 
(Miyaoka 2012: 1325). 
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Table 5.16: Tentative transitive interrogative person endings in Naukan Yupik 
(based on Menovshchikov 1975: 241f.) 


A/O asc 3PL 3DU 1$G (PL 1DU 2SG 2PL 2DU 
35G Dn -ki -kyk -ņa  -tykut -tykuk -tyn -si -tyk 
3PL  -txug -tyki -tykyk ma -tykut -tykuk -tyn -si -tyk 
3DU ? ? ? ? ? ? ? ? ? 

ısG -ku -nka -xka -kyn -msi -mtyk 
IPL -vut -put  -xput -mtykyn -msi -mtyk 
iDU -vuk -puk -xpuk ? ? ? 

2SG On -ki -kyk -ņa -kut -kuk 

2PL -siûņ -siki -sikyk -see -sikut  -sikuk 

2DU ? ? ? ? ? ? 


Some of the agreement forms in Central Alaskan Yupik (Table 5.14) are still analyz- 
able in two different affixes (stem-A-O). According to this observation, the following 
suffixes can be extracted: -Ø- “3sc.A’, -t- ‘3PL.A’, -y(ni)- ‘3DU.A’, -Ø- ‘2sG.A’, -ci- ‘2PL.A’, 
and -tiy- ‘2pu.A’. These are related to, but not identical with the intransitive markers 
(Table 5.13). With some exceptions, these suffixes are also present in Central Alaskan 
Yupik and Naukan Yupik. In Central Alaskan Yupik “gaps in the paradigm are filled in 
with an intransitive person marker, which is extended to transitive use, without distin- 
guishing the object number” (Miyaoka 2012: 1350). Central Siberian Yupik on the other 
hand has special third person as well as second person singular object forms and em- 
ploys the indicative forms as second person plural and dual object endings. In Central 
Siberian Yupik, the interrogative mood marker (Table 5.12) takes a form with i instead 
of a before the endings with subscript ;. “The final or semi-final vowel of these endings 
if often lengthened (and e changed to a) if the verb is used in a ‘yes’ or ‘no’ question.” 
(Jacobson 1979: 61) Menovshchikov’s (1975: 242) table of interrogative forms on which Ta- 
ble 5.16 was based seems to be rather problematic, as it apparently shows some confusion 
regarding grammatical relations. My analysis usually follows the comparison with Cen- 
tral Alaskan (Table 5.14) and Central Siberian Yupik (Table 5.15) (see also Menovshchikov 
1975: 241). For lack of data, dual A forms have usually been excluded. In some instances 
either intransitive or transitive verb endings may be employed with a slight change of 
meaning. 


(61) Central Alaskan Yupik 


a. ca-mek ner-yug-ci-t? 
what-ABM.SG eat-DES-2SG.Q-2SG.INTR.Q 
"What (kind of food) do you want to eat?’ 
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b. ca ner-yug-ci-u? 
what eat-DES-2sG.Q-2sG.TR.Q 


"What/which (specific) food do you want to eat?' (Miyaoka 2012: 756) 


A difference between Central Alaskan Yupik and Central Siberian Yupik is that in the 
former they are limited to content questions while in the latter they are also encountered 
in polar questions. 


(62) Central Siberian Yupik (St. Lawrence Island) 
a. negh-yug-si-n? 
eat-DES-2SG.Q-2SG.Q 
‘Do you want to eat (anything)? 
b. sa-men negh-yug-si-n? 
what-ABM.SG eat-DES-2SG.Q-2SG.INTR.Q 
‘What (kind of food) do you want to eat?’(Jacobson 1979: 60) 


Table 5.17: Summary of question marking in Eskaleut. 


PQ CO AQ 


Aleut (Atkan)  -ii - asxuunulax ‘or’ 
Mednyj Aleut - - ? 
CAY V#qaa (see Tables 5.12, 5.13, (A#qaa) + wall’u ‘or’ 
5.14) 
CSY (see Tables 5.12, 5.13, id. ? 
5.15) 
Naukan (see Tables 5.12, 5.13, ?id. ? 
5.16) 
Sirenik (see Tables 5.12, 5.13) ?id. ? 


There is a marked contrast between Aleut and Yupik question marking (Table 5.17). 
Aleut resembles the Northeast Asian mainstream, while Yupik belongs to an area in the 
northern part of NEA that exhibits complex interrogative mood systems (e.g., Audova 
1997). Other languages belonging to this belt are Nganasan (§5.12.2), Yukaghiric (§5.14.2), 
and perhaps Negidal (85.10.2). 


5.4.3 Interrogatives in Eskaleut 


The comparison of interrogatives in Yupik languages and Sirenik is relatively straight- 
forward (Table 5.18). The interrogative system in all four languages listed is relatively 
similar, but Sirenik is clearly the most aberrant. All three languages have resonances 
in q- and n-. Apart from CAY, there is an additional resonance in s-. Yupik thus has 
K-interrogatives. The authors also mention the form PE “ay ‘what did you say’ (CAY ai, 
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CSY ay, Naukan Yupik ay), but this is not a true interrogative (Fortescue et al. 2010: 62). 
Other interrogatives such as “cuug ‘why’ or *qanuq ‘how’ can only be found in Inuit 
(Fortescue et al. 2010: 98, 310). Central Siberian Yupik interrogatives were also mostly 
left unexplained in Jacobson’s (2001: 49, 57, passim) description. Fortunately, there is a 
very good analysis for Central Alaskan Yupik by Miyaoka (2012: 443-461) that can be 
transferred to the other languages. 

Similar to Tungusic, Chukotko-Kamchatkan, and Samoyedic the CSY stem sa- ‘(to do) 
what’ (CAY ca-) may take both nominal and verbal morphology and the forms meaning 
‘why’ are derived from its verbal form. CAY has a form ciin ‘why’ that is a contrac- 
tion of ca-nan, a third person singular causal connective mood form and cognate with 
CSY sa-nan. Naukan Yupik si(i)mi has a similar phonological development and seems 
to correspond directly with the third person reflexive form sa-nami in CAY and Sirenik. 
According to Jacobson (2001), sa-nami is a form that requires a third person singuar sub- 
ject while sa-namen (not listed in Table 5.18) is used with third person plural forms. An 
interesting speciality of Yupik and Sirenik is the existence of two forms meaning ‘when’ 
for future and past actions that has no equivalent in NEA. The suffix -ku in qa-ku- is 
a future form, but Miyaoka (2012: 452f.) does not comment on the etymology of gan- 
vay- ‘when (Psr)' (CSY qavgaq, Naukan Yupik qamvaq) but is of the opinion that it also 
derives from the stem qa(r)-. CAY qavci-n ‘how many’ (CSY qafsiin, Naukan Yupik qaf- 
sit) is a plural absolutive form of the stem qavciy- (Sirenik qafsi(y-)). All four languages 
above have KIN-interrogatives, although the stem really is ki(t)-, ki-na being its singular 
absolutive form and kin-kut its plural absolutive form. The selective interrogative naliq 
is apparently an unanalyzable form, and can be inflected, e.g. nallir-put ^which one of 
us’ (cf. Miyaoka 2012: 451). The form natan ‘how’ is restricted to Naukan Yupik, CSY, and 
Sirenik (Fortescue et al. 2010: 223), while CAY has the interrogatives qaillun ~ qaill’ and 
qayu- instead (Miyaoka 2012: 454f.). The special form natan is certainly connected with 
the stem na- that is ambiguous and means both ‘which’ and ‘where’. Table 5.19 compares 
locative interrogative paradigms in CSY and CAY. Demonstratives in Central Alaskan 
Yupik also have an allative ending +vit and variants (Miyaoka 2012: 769). 

Because of a rather unsystematic presentation by Bergsland (1997: 80-83), no complete 
analysis of interrogatives in all the Aleut dialects can be presented here (Table 5.20). 
Copper Island Aleut, in addition to Aleut interrogatives, has borrowed a number of Rus- 
sian interrogatives (Table 5.21). Similar to Eskimo and several other languages in NEA 
(Chapter 6), no other interrogative starts with the same consonant as does kiin ‘who’. 
However, the interrogative system is quite different from Yupik and Sirenik, although 
some similarities can be observed. The personal interrogative kiin ‘who’ (pu kiinkux, PL 
Eastern kiinkun, Atkan kiinkus), for example, is directly comparable. There is one major 
resonance in q-. The stem qana- has the same semantic scope over selective and loca- 
tive meaning as does na- in Yupik and Sirenik. The stem alqu- (Attuan aqu-) ‘(to do/be) 
what, what kind/part, to be how’ is entirely absent from Eskimo, but exhibits the same 
ambiguity between a verbal and nominal stem as does PE *cu- ‘(to do) what’. The causal 
interrogative alqu-l ‘why’ likewise has a verbal basis. 
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Table 5.18: PE = Proto-Eskimo and PY-S = Proto-Yupik-Sirenik interrogatives 
and cognate sets according to Fortescue et al. (2010: 97, 98, 190, 223, 304, 310, 
318); not all variants and dialectal forms are shown 


Reconstructions CAY CSY Naukan Sirenik 
PE *ki(na) kina, kina, kina, kina, 
‘who’ PL kinkut PL kinkut PL kinkut PL kinoki 
PE *kitu kitu- kitu- kitu- - 
‘which, who’ 
PE *qaku qaku qakun qaku qaku 
"when. pur" 
PE *qaga, qanyaq qavgaq qamvaq qay 
qagyaq 
"when. per" 
PE *qavcit qavcin qafsina(t), qafsit qafsi(y-) 
‘how many’ qafsiin ‘which 
inarow 
PE *na- na- na- ‘which’ na- na- 
‘where’ 
PE *nayu, nauxu 3sc naayu, naa ‘where’ 3sc nayola 
*na(C)ur 3DU nakak, 
‘where (is it)’ 3PL naaki 
PE *nallin naliq naliq naliq nacəy 
‘which’ 
PY-S *natə- natə- natə- ‘where,  - natalnuy ‘which’, 
‘which (part)’ natu- ‘which’ natu ‘where’ 
PY-S *natan 5 naton naton naton 
‘how’ 
PY-S “cayan, ciin 3sc saran, si(i)mi sanami 
*caami 3SG.REFL 
‘why’ sarjami, 
1sG sanama 
PE *cu(na) ca- sa- suna, saroca ‘what’, 
‘(to do) what’ sa- sa- ‘to do what’ 


138 


5.4 Eskaleut 


Table 5.19: CSY (Jacobson 2001) and CAY locative interrogatives (Miyaoka 2012) 


Meaning CAY 
where (-1oc) na-ni 
whence (-ABL) na-ken na-ken 
whither (-ALL) na-vek na-mun 


Table 5.20: Atkan Aleut interrogatives according to Bergsland (1997: 80ff.) 


Meaning 


Form 


who 

what (-ABS.sG) 

to do what (-v-) 
which 

where 

why (-cv8) 

how 

how many/much 
when 


kiin, pu kiin-kux, Pr kiin-kus 
alqu-& 

alqu-sa- 

qanan, PL qana-kus 

qanang 

alqu-l 

qanamat-, qanamasix 
qanaang 

qanayaam 


Table 5.21: Interrogatives in Copper Island (Mednyj) Aleut (Sekerina 1994: 26) 
in comparison with Attuan Aleut (Bergsland 1997: 80ff.) and Russian (§5.5.3.3) 


who 

what (-ABS.sG) 
how 

why 

where from 

how many/much 
when 

which 

which 


where 


Mednyj Aleut 
kiin 

aqu-X ~ aqo-& 
aqu-ta-l 
aqo-li, aqu-ma 
qanaaga 
qanan 

ka(g)da 
kakuy 
katoraye 
kuda 


Attuan Aleut Russian 

kiin - 

aqu- - 

aqu-ta- - 

aqu- - 

qanaax - 

qanang - 

- kokda/korja 
- kakoj/xaxoit 
- kotoryj/koropsri 
- kuda/xyza 
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5.5 Indo-European 


5.5.1 Classification of Indo-European 


According to Glottolog (Hammarstróm et al. 2016), Indo-European encompasses 583 lan- 
guages. Similar to §5.9 on Trans-Himalayan, this section can only deal with a minor 
part of the whole Indo-European family. The exact internal phylogenetic structure of 
the family is not absolutely clear (see §2.5), but one may roughly distinguish 10 differ- 
ent branches as well as a couple of unaffiliated and sparsely attested languages that 
are excluded here (Fortson 2010: 10): 1. Albanian, 2. * Anatolian, 3. Armenian, 4. Balto- 
Slavic (Baltic, Slavic), 5. Celtic, 6. Germanic, 7. Greek, 8. Indo-Iranian (Indo-Aryan, Ira- 
nian, and perhaps Nuristani), 9. Italic, and 10. + Tocharian. Only West Germanic (Ger- 
man dialects, Yiddish, English), East Slavic (Russian, Ukrainian), East Iranian (Sogdian, 
Khotanese, Tumshuquese, Sarikoli), and Tocharian (Tocharian A, B, and perhaps C) have 
representatives in NEA. For the mixed Persian-Uyghur language Eynu (dint X 5), spo- 
ken in the southeast of the Tarim basin, see $5.11. Taimyr Pidgin Russian (or Govorka) 
and Chinese Pidgin Russian (sometimes called Kyakhta Pidgin) will be included in this 
chapter, but the mixed Russian-Aleut language Mednyj Aleut has been treated in 85.4 
on Eskaleut. 


5.5.2 Question marking in Indo-European 
5.5.2.1 Question marking in Proto-Indo-European 


PIE presumably had interrogatives in initial position, optionally preceded by a topical- 
ized element (Fortson 2006: 232). Questions in PIE were probably primarily marked with 
a special intonation contour (Delbrück 1900: 259-288; Lehmann 1974: 101f., 120-123, 179f.; 
Hackstein 2013: 99), although word order change is attested in several Indo-European 
branches (Hackstein 2013: 102). Some old Indo-European languages had sentence-initial 
or second position clitics (e.g., Gothic an, =u, Braune & Heidermanns 2004). However, 
the markers in individual branches are not cognates of each other, which is why no such 
marker can be reconstructed. 


5.5.2.2 Question marking in Germanic 


Modern Germanic languages generally have verb-initial word order for marking polar 
questions (63b). In declarative sentences the verb usually takes second position (63a). 
Consider the following constructed German examples as well as their English translation. 
In addition, the German polar question has a rising intonation as opposed to the falling 
intonation in the declarative sentence. 
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(63) German? 


a. Peter hat einen Hund gekauft. 
PN has a.Acc.Mdog bought 


‘Peter bought a dog: 


b. Hat Peter einen Hund gekauft? 
has PN a.Acc.Mdog bought 


‘Did Peter buy a dog?’ 


If no other auxiliary is present, English requires the addition of the auxiliary to do. As 
further explained in Chapter 4, the cross-linguistically untypical phenomenon of word 
order for question marking (Dryer 2013j) originated in the loss of a second position clitic 
such as Gothic -v. Such clitics usually attach to the verb in polar questions and to focused 
elements in focus questions. When the question marker was lost, the verb-initial word 
order took over its function (e.g., Miestamo 2011). Plautdiitsch likewise preserves the 
verb-initial word order. 


(64) Altai Low German 
váitst duu va"t diina fryy feelo deed? 
know.PRs.2sc 2sG what 2sG.GEN.F wife miss.INF do.PRs.3sc 


‘Do you know what problem your wife has?" (Jedig 2014: 170) 


An exception among Germanic languages is Yiddish, which has borrowed the Polish, 
Ukrainian, or Belorussian initial question marker, which will be discussed in §5.5.3.3. 
Nevertheless, there is still a word order change as well as final rising intonation as op- 
posed to the falling declarative intonation. 


(65) Yiddish 
a. mojse hot gekojft a hunt. 
PN has bought a dog 
"Moses bought a dog’ 
b. ci hot mojse gekojft a hunt? 
Q hasrN bought a dog 
‘Did Moses buy a dog?’ (Sadock & Zwicky 1985: 181) 


The German examples in (63) above that were constructed on the basis of these Yiddish 
examples exhibit in addition a slightly different word order in that the participle stands 
sentence-finally. The Yiddish word order is not usually found in German and has an 
archaic flair to it. The initial question marker is optional. 


The glossing in this chapter is somewhat simplified and relies on the relatively close relationship of English 
with other languages. 


Cf. non-standard German (constructed) Weißt du, was deiner Frau fehlen tut? 
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(66) Yiddish 
bist du meshuge? 
be.PRS.IND.2sG 28G crazy 


"Are you crazy?' (Jacobs et al. 1994: 408) 


(67) German 
Bist du verrückt? 
be.PRS.IND.2sG 28G crazy 


‘Are you crazy?" 


Focus questions in German and English have the same structure as polar questions 
but contain an additional intonational nucleus on the focused element. English may also 
make use of a cleft, e.g. Is it to school that you are going? 


(68) German 
Gehst du zur Schule ? 
g0.PRS.28G 2SG to.the.F.sG.DAT school 


"Are you going to school?’ 


I am unaware of any descriptions of focus questions in Yiddish or Plautdiitsch but it is 
probable that they have a pattern similar to German. 

Content questions in German, Plautdiitsch and Yiddish do not have a special marking 
but do have sentence-initial interrogatives. They may be preceded by a conjunction such 
as German und ‘and’. 


(69) German (Colloquial) 
a. Wo  is(t)der Mensch? 
where is  the.w.NOM person 
"Where is the person?’ 


b. Na, und warum? 
well and why 


"Well, why then?’ 


(70) Yiddish 
vu | izder mentsh? 
where is the.M.NOM person 


"Where is the person?' (Jacobs et al. 1994: 408) 


(71) Altai Low German 
na, on vu:rgm? 
well and why 


"Well, why then?' (Jedig 2014: 170) 


7Colloquially, German also has the adjective meschugge ‘crazy’. 
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Yiddish has an optional marker =zhe that may attach to interrogatives and seems to 
intensify the sentence (it was translated as ‘on earth’) (Jacobs et al. 1994: 413). It is of West 
Slavic origin, e.g. Czech -Ze (Sussex & Cubberley 2006: 317). English differs somewhat 
from these three languages in that content questions usually require an auxiliary or to 
do to follow the interrogative. 


(72) English 
a. Where is the person (going)? 
b. Where did he go? 


Alternative questions in German combine usual polar question marking (verb first 
word order and intonation) with the disjunction oder [-e] ‘or’. Negative alternative ques- 
tions have the standard negator nicht that in colloquial speech often takes the form nich. 
English has a similar polar question marking in combination with or. 


(73) German 


a. Magst du Tee oder Kaffee? 
want.PRS.IND.2sG 2sG tea or coffee 


‘Do you want tea or coffee?’ 
b. Magst du Tee oder nicht? 
want.PRS.IND.2SG 2sG tea or not 


‘Do you want tea or not?’ 


Altai Low German has verb first word order in combination with óuda nich ( German 
oder nich(t)) for negative alternative questions and probably óuda (2 German oder [-e]) 
for plain alternative questions (Nieuweboer 1999: 177). Yiddish also has the verb-in initial 
position but exhibits alternation between the use of odar ‘or’ and ci, which has been 
influenced by Slavic (Jacobs 2005: 205). 

German has two different constructions in which the disjunction oder takes sentence- 
final position and acts as a question marker. In the first case it is accompanied by a 
longish and level intonation, in the second with a sharp rise in intonation. The former is 
a fully elliptic alternative question and the latter a tag question. 


(74) German 
a. Magst du Kaffee oder ...? 
want.PRS.IND.2sG 28G coffee or 
‘Do you want coffee or not?’ 


b. Du magst Kaffee, oder? 
2sG want.PRS.IND.2sG coffee or 


"You want coffee, right?’ 
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In the latter type the tag may also take the negative form oder (etwa) nich(t) with an 
optional emphatic marker. German has several more tags such as ja ‘yes’, richtig ‘right’ 
and nicht(t) ‘not’, or dialectal forms such as wa(t) (Standard German was ‘what’) and 
the synchronically unanalyzable form ge(lle). In all cases a tag seems to be accompanied 
with a sharp rising intonation contour. English has related tags such as right but is best- 
known for its tags whose polarity depends on the preceding declarative, e.g. is it vs. isn’t 
it, do you vs. don’t you etc. Plautdiitsch has the tag question markers jau? ~ jo? ‘yes’ 
(German ja), nee? (German nein, colloquially ne(e)), óuda ‘or’ (German oder), and es nich 
zóu? ‘isn’t it’ (German is(t) es nich(t) so?) attached to the end of a declarative sentence 
(Nieuweboer 1999: passim). 

Indirect polar questions require a special marker, English if or whether, Plautdiitsch 
ous, or German ob. Interestingly, English whether historically derives from the PIE in- 
terrogative "k"óteros ‘which of two’ (see §5.5.3.1), German ob and English if show con- 
nections with conditionals, but the etymology of Plautdiitsch ous is not perfectly clear. 
Yiddish has adopted the use of ci in indirect questions from Slavic. 


(75) Altai Low German 
ous ät“ siina fryy ka"n heilo 
if 1sG 3sG.GEN.F wife can.?IND.PRS.1sG cure.INF 
‘whether I can cure his wife’ (Jedig 2014: 170) 


In German an embedded polar question can also stand on its own to form a question 
usually addressed to oneself and roughly meaning ‘I wonder’. 


(76) German 
(Ich frage mich,) obes wohl regnen wird. 
1sG ask.PRS.IND.1sG 1sG.ACC if it perhaps rain.INF will.PRS.IND.3SG 


‘I wonder whether it will rain? 


Indirect content questions have an interrogative instead of the mentioned indirect 
question marker found in polar, focus, or alternative questions. Indirect content ques- 
tions in German and English have the interrogative in initial position, but have a differ- 
ent word order from plain content questions. In German, verbs are strictly final in both 
types of indirect questions. 


(77) German 
(Ich frage mich,) wer das wohl ist? 
1sG ask.PRS.IND.1sG 1sG.Acc who that perhaps is 


‘I wonder who that will be’ 


Indirect content questions may also be used on their own for self questions. Both types 
of indirect questions are almost obligatorily accompanied with the modal marker wohl 
(cognate with English well). 


5The word order of Plautdiitsch is also possible in German but sounds very archaic. A German equivalent 
would be something like the following: ob ich seine Frau heilen kann. 
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5.5.2.3 Question marking in Slavic 


Most Slavic languages have a second position polar question clitic =li (Sussex & Cubber- 
ley 2006: 359). In Russian it is found especially in the written language. It also marks fo- 
cus questions, in which case the focused element instead of the verb has to take sentence- 
initial position. 
(78) Russian 
a. otvétil-li studént na vsé voprósy? 
answer.M.sG.PST-Q student to all questions 
‘Did the student answer all the questions?’ 
b. studént-li otvétil na vsé voprósy? 
student-Q answer.M.sG.PST to all questions 


"Was it the student who answered all the questions?’ (Sussex & Cubberley 
2006: 359) 


Only some languages lack the clitic but have a sentence-initial particle instead, including 
Ukrainian cy/un, Belorussian ct). and Polish czy, which has been borrowed by Yiddish. 


(79) Ukrainian 
cy zdoróvyj ty? 
Q healthy 2sc 
"Are you well?' (Sussex & Cubberley 2006: 359) 


In Ukrainian there is a sharp rise at the end of the sentence in polar questions, or over 
the focused element in focus questions (Shevelov 1993: 978). But this is less pronounced 
if the question is already marked with cy. In focus questions it is also possible to move 
the focused element into sentence-initial position. 


(80) Ukrainian 
a. cyty tam buv? 
Q 2sc there were 
"Were you there?’ 


b. ty buv tam? 
2sc were there 


"Were you there?’ 


c. buv ty tam? 
were 2sc there 


"Were you there?' 


d. tam ty buv? 
there 2sc were 


"Were you there?’ (Shevelov 1993: 978) 
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Interrogatives are usually fronted in both languages but not necessarily so in Russian. 
Content questions remain unmarked in both languages. 


(81) Russian 
čto eto? 
what this 


‘What is this?’ (Comrie 1984: 23) 


(82) Ukrainian 
de ty buv? 
where 2sG were 


"Where were you?' (Shevelov 1993: 979) 


In Russian the intonational nucleus can most often be found on the interrogative itself 
(Comrie 1984: 24). 

Alternative questions in Russian and Ukrainian are quite different from each other. 
Ukrainian uses the polar question marker in between the two alternatives. Given its 
syntactic behavior in polar questions, it may perhaps be said to attach to the beginning 
of the second alternative. In Russian, on the other hand, there is a disjunction ili ‘or’. 


(83) Ukrainian 
ty buv u teatri, éy v muzeji? 
2sc were at theater Q at museum 


"Were you at the theater or in the museum?' (Shevelov 1993: 978) 


(84) Russian 
vy xotite čaj, ili kofe? 
2PL want tea or coffee 


‘Do you want tea or coffee?’ (Comrie 1984: 23) 


In Russian the first alternative takes neutral polar question intonation with a sharp rise 
and immediate less sharp fall on čaj, the second alternative has falling intonation similar 
to an interrogative in content questions. Ukrainian negative alternative questions take 
Eu ny/uu HH ‘Q NEG’ (Pugh & Press 1999: 285). In Russian the form ili net/uxu ner is 
used (elicited in June 2017). 

Russian uses question tags less frequently than English or German. But one possibility 
is to attach ne pravda li ‘NEG truth o to a declarative sentence (Comrie 1984: 32). Russian 
furthermore has the question marker razve that may stand sentence-initially and less 
frequently sentence-finally or be adjacent to the focus. Comrie (1984: 21f.) describes the 
use of razve as follows: “the questioner had a certain prior expectation; some piece of 
new information leads the questioner to believe that his prior expectation may be wrong; 
therefore he asks the appropriate general question with razve”. 
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(85) Russian 
razve ty uezzaes? 
Q 2sc leave 


"Are you leaving?' (Comrie 1984: 22) 


Intonation is again similar to plain polar questions. Topic questions are introduced with 
the conjunction a ‘and’, e.g. a viktor? ‘what about Victor?’ (Comrie 1984: 27f.). 

Questions in Chinese Pidgin Russian seem to be generally unmarked. Interrogatives 
remain in situ. Interestingly, the Chinese A-not-A pattern is also possible, e.g. pravda ne 
pravda? ‘true NEG true’ (Shapiro 2010: 39), cf. Mandarin dui-bu-dui?. 


(86) Chinese Pidgin Russian 


a. za vashe  zh'onusheki mes'aca posidi esa? 
TOP 2sG.GEN wife.?PL together sit ^ REL 
‘Do you sit together with your wives?’ ? 

b. ni-dy | $yma mür.mur? 
2sG-GEN what say 


"What are you saying?’ (Shapiro 2010: 37, 15) 1° 


In Taimyr Pidgin Russian polar questions may also be unmarked. But perhaps there is 
a special intonation contour in both pidgins. 


(87) Taimyr Pidgin Russian 
tebja urusé-to jest? 
2sG rifle-HL Ex 


‘Do you have a rifle?’ (Stern 2005: 312) 


The suffix -to glossed as “highlighter” usually has a discourse function and is of northern 
Russian origin (see Stern 2005: 309; 2012: 439). Content questions are unmarked and 
interrogatives are either sentence-initial or preverbal (Stern 2012: 508). 


(88) Taimyr Pidgin 
čego tebja nado menja Cum mesto? 
what 2sG DEON 1sG tent place 


‘What do you want in my tent? (Stern 2012: 361) 


The polyfunctional case marker mesto, from a noun meaning ‘place’, is an innovation 
of the pidgin (Stern 2012: 360-382). There is an instance of an open alternative question 
that combines a disjunction, an interrogative, and a (polar) question marker in that order. 


?The topic marker za stems from Mongolic (Shapiro 2010: 35). 
10Cf, Mandarin ni-de VRHI '2s6-GEN', shénme {t+ ‘what’. 
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(89) Taimyr Pidgin Russian 
ty govorit, mama amadjane puskat budem, ili éego=li? 
2sG say.3sG mother hither NEG let.INF AUX.FUT or what=Q 


‘Are you saying that we should not let mother come here or what?’ (Stern 2005: 
310) 


The use of a disjunction may be traced to Russian influence, but the presence of the 
Russian polar question marker after an interrogative might be local influence. Compare 
Nganasan open alternative questions in which the second part takes a similar form, e.g. 
maa-nu- ‘what-Aor.Q-’ (§5.12.2). However, similar phenomena are also known from Rus- 
sian. For example, the following sentence was recorded in the Allaikhovsky district in 
the north of the Sakha Republic: A, zdec’ čto=li? ‘So here or what?’ (Krasovitsky 2004) 


5.5.2.4 Question marking in Iranian 


Polar questions in the extinct Iranian language Saka are unmarked except for, perhaps, 
intonation. Negative alternative questions are marked with a disjunction aa that later de- 
veloped into o, followed by the negator ne. Content questions seem to have remained un- 
marked and interrogatives were fronted (Emmerick 2009: 402). Optionally a “discourse 
initiator” (e.g., tta ‘thus, so’) could precede an interrogative (Emmerick 2009: 403), which 
is typologically similar to initial Mandarin nd JJ ‘that’ or English so. 


(90) Saka (Tumshugese) 
uvasanu samvaru paitryai patandya? 
laywoman.GEN.PL regulations.acc.sG can.2sG.PRS.IND keep.INF 


‘Can you keep the regulations of the laywomen?’ (Emmerick 1985: 10f., 14) 


(91) Saka (Khotanese) 
cuude bremá? 
why  weep.2sG.IND 


"Why are you weeping?' (Emmerick 2009: 402) 


Sogdian has an optional sentence-initial polar question marker (a)éu-t(i) what-cowr' 
(Yoshida 2009: 316f.). Negative alternative question seem to have the same marker at 
the beginning of the whole sentence in combination with a marker kataar(-ati) ‘which(- 
COMP)' between the two alternatives. Sogdian thus has two question markers that de- 
rive from interrogatives. Sogdian kataar, like English whether, derives from PIE *k"ótero- 
"which of two' (85.5.3). Content questions have no special marking. Rhetorical questions 
have in addition a marker pnuukar. Interrogatives remain in situ. 


(92) Sogdian 


a. ə.ču.ti pnuukar tawa | waanoo nee patyoosti? 
Q Q.RHET by.youthus NEG heard.PRET 


‘Have you never heard this?’ 
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b. éu.tixa zaakt  tadeed ayatant kataar.ati nee? 
Q the children there came o NEG 


‘Have the children arrived at your place or not?’ 


c. tayu peernamstar Cu əktya k0aare? 
2sG before what deed do.PRET.2SG 


"What was it that you did before?' (Yoshida 2009: 317) 


There are initial question markers in Persian (aayaa) and Tajik (oyo) as well, but these 
show no connection with interrogatives (Windfuhr & Perry 2009: 438). Tajik may also 
employ the Uzbek sentence-final marker =mi (Windfuhr & Perry 2009: 481, §5.11.2). 

For Sarikoli there is a relatively old description by Shaw (1876: 29). According to him, 
polar questions in Sarikoli have a sentence-final marker â, while content questions re- 
main unmarked. This marker, according to Gao Erqiang (1985), has the form o and has 
been reanalyzed here as an enclitic. 


(93) Sarikoli 
bofa-af tag wand-o? 
PN=2PL actually see.psT=Q 


‘Have (you) actually seen Bosha?’ (Gao Erqiang 1985: 62) 


The same marker appears twice in alternative questions. The following example contains 
in addition an element naji in between the two alternatives that was glossed as a negator 
but seems to have the function of a disjunction here. 


(94) Sarikoli 
mal tuyur  xyor-an-o naji wi budo yor-an-o? 
1PL chicken eat-1PL-Q NEG that beef eat-1PL=Q 
‘Do we eat chicken or do we eat that beef?’ (Gao Erqiang 1985: 65) 


According to Gao Erqiang (1985: 118), Wakhi has a disjunction jo and question mark- 
ing on the first alternative only (=a), which is a construction similar to surrounding 
languages such as Uyghur ($5.11.2). In Sarikoli there are several tag question markers 
that contain the same question marker, e.g. na sou-d-o? ‘NEG be.possible-3sc-Q' (Gao 
Erqiang 1985: 90). 


(95) Sarikoli 
tudsik ziv ati wazon-d, rust-o? 
PN language? know.N.psT-3sc true=Q 


‘(She) knows Sarikoli, right?’ (Gao Erqiang 1985: 89) 


The question marker seems to be connected with Burushaski, e.g. bás-a? ‘Is it enough?’, 
and several surrounding languages (Yoshioka 2012: 190). Consider an example from the 
Dardic language Palula spoken in the extreme north of Pakistan where the marker, de- 
pending on the dialect, takes the form =aa or =ee. 
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(96) Palula (Dardic, Indo-European) 
bud-u=ee? 
understand.PFv-MSG=Q 
‘Did you understand?’ (Liljegren 2016: 403) 


Content questions in Sarikoli are unmarked and interrogatives seem to remain in situ. 


(97) Sarikoli 
mur vrud tar ko? 
1sG.GEN brother Ex where 


"Where is my brother?' (Gao Erqiang 1985: 86) 


5.5.2.5 Question marking in Tocharian 


Tocharian has unmarked polar questions (98), but perhaps had a special intonation con- 
tour that cannot be reconstructed. 


(98) Tocharian B 
ate kampal yamasasta? 
away coat.ACC do.PRET.2sG 


"Have you put (your) coat away?’ (Hackstein 2013: 110) 


In Tocharian A there are two optional question markers, second position =te and 
sentence-final assi that may be found together in one sentence. 


(99) Tocharian A 
ynálek-te lo — kálk assi? 
elsewhere-Q away go.PRT.3sc Q 


‘Has (s)he gone somewhere else?’ (Hackstein 2013: 111) 


According to Hackstein (2004: 175) assi derives from PIE “hzet + *k"ih, (cf. Latin atqui), of 
which the latter part is an instrumental form of an interrogative that is also the source of, 
for example, Polish czy (cf. Latin qui ‘how’). The first part “hzet, or "buet with one of the 
laryngeals hz or h; according to Mallory & Adams (2006: 289ff.), is a preposition meaning 
‘away, beyond’ (e.g., Tocharian B at(e) ‘away’). Hackstein assumes that aśśi started out 
as a question tag similar to German wie ‘how’ or was ‘what’. Content questions may be 
unmarked in both Tocharian B (e.g., Adams 2013: 157) and Tocharian A, e.g. kus tam? 
‘Who is that?’ (Carling 2009: 156). The particle assi can also be encountered in content 
questions and sometimes fuses with interrogatives, e.g. tà, tassi ‘where’ (Sieg & Siegling 
1931: 182). In some instances the last consonant of the interrogative is lengthened, e.g. 
kus, kuss assi ‘who, what’ (Sieg & Siegling 1931: 190). In Tocharian A some interrogatives 
are also often followed by pat (nu), the exact function of which reamains unclear to me, 
e.g. kus pat nu 'or what now' (Carling 2009: 156). In alternative questions one marker 
appears on each alternative in Tocharian B. 
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(100) Tocharian B 
pandkte=wat yopsa, nande=wat? 
PN.NOM-Q  enter.PRET.38G PN.NOM=Q 


‘Has Buddha or Nanda (just) entered?’ (Hackstein 2013: 110) 


Interestingly, Hackstein (2013: 111) also has an example of a negative alternative ques- 
tion that has a disjunction, Tocharian B epe mà? ‘or NEG’. The same disjunction can 
optionally also be found in plain alternative questions (Hackstein 2013: 111). Tocharian 
A has negative alternative questions with the marker -te used once on each alternative. 
In the second alternative it attaches to the negator, mā=te. In one such example there 
is an additional element na in the second alternative, glossed as a question marker by 
Hackstein. 


(101) Tocharian A 
campdl=te nasan cesám wrasassi waste müskatsi, 
be.able.GER2.NOM=Q COP.PRs.1sc this.GEN.PL being.GEN.PL refuge be.INF 
ma=te cämpäl (na) sam? 
NEG-Q be.able.GER2.NOM (Q) COP.PRS.1SG 


‘Am I able to provide refuge to the beings or am I not able?’ (Hackstein 2013: 113) 
The double marking strategy optionally combines with the disjunction epe ‘or’. 


(102) Tocharian A 
ma=(t)e nātäk cam bra(mam) e(pe) ma=(t)e was? 
NEG=Q master this.ACC.SG PN or NEG=Q_ 1PL.ACC 


"Will the master not keep this Brahman or will he not keep us?’ (Hackstein 2013: 
113) 


5.5.2.6 Summary 


Question marking in Indo-European is very different from the majority of languages in 
NEA. As expected for a family with such a long history, the marking of questions varies 
strongly from language to language. Even within the relatively shallow Slavic branch 
there are marked differences. Generally there is a tendency for initial particles or second 
position clitics and disjunctions. To the best of my knowledge, almost all Indo-European 
languages included here have unmarked content questions. Interestingly, at least four 
languages (German wie, was, Ukrainian cy (hence Yiddish ci), Sogdian (2)cu-t(i), kataar(- 
ati), and Tocharian A assi) show a development from interrogative to polar question 
marker and/or question tag, which is quite unusual for NEA (but see §5.12.2 on Selkup). 
This is also known from other Indo-European languages, such as Sanskrit kad ‘what’ 
(Hackstein 2013: 100), Bengali ki ‘what’ (Thompson 2012, see 84.2.1), or Palula ga ‘what’ 
(Liljegren 2016, 84.2.1, 84.2.3). 
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Table 5.22: Summary of question marking in Indo-European languages 


PQ co AQ 
English #V + (to do) (to do) #V + (to do) + or 
German #V - #V + oder ‘or’ 
Plautdiitsch #V - ?#V + óuda ‘or’ 
Yiddish #ci V - #V + odor ‘or’, #V + X ci Y 
Ukrainian #čy - Xéy Y 
Russian -, #V=li - ili 'or 
- - ?ili ‘or’ + =li 
Taimyr Pidgin 
Russian 
CPR - - ? 
Sogdian #(a)éu.t(i) - #(a)¢u.t(i) + X kataar(-ati) Y 
Khotanese - - aa (> 0) ‘or’ 
Tumshugese - - o ‘or 
Sarikoli =o# - 2x=o# + ?naji ‘NEG’ 
Tocharian A #A=te, ?aśśi# -, ?a$$i 2x =te (+ epe ‘or’) 


Tocharian B - - 2x =wat, epe ‘or’ 


5.5.3 Interrogatives in Indo-European 
5.5.3.1 Interrogatives in Proto-Indo-European 


A somewhat outdated, but nevertheless useful, typological classification of Indo-Euro- 
pean is in so-called centum and satam languages. The designation follows the Latin and 
Avestan words for ‘hundred’, respectively, that represent the two types. The two groups 
are divided by their reflexes of Proto-Indo-European velar, palatal and labiovelar conso- 
nants. In centum languages the palatals and in satam languages the labiovelars became 
plain velars (Table 5.23). PIE *Kmtom ‘hundred’ starts with a palatal and thus remained 
a palatal in Iranian (and later changed to s in Avestan) but became a plain velar written 
as «c» in Latin (cf. Fortson 2010: 146). The languages in this study belong to both the 
centum (Germanic, Tocharian) and satam (Iranian, Slavic) types. 

This division is important for the purposes of this study, because PIE interrogatives 
usually began with the labiovelar "k" that was preserved in Germanic and Tocharian, but 
changed to plain velars in (Indo-)Iranian and (Balto-)Slavic. In Tocharian the labiovelars 
were later mostly lost in favor of plain velars. However, in some instances they show 
reflexes, e.g. Tocharian A kus, B kuse ‘who’. In Germanic "Kk" regularly changed to "br 
e.g. Old English hwā or Old High German (h)wer ‘who’. German later entirely lost the 
initial consonant, e.g. German wer ‘who’. In English the development is more compli- 
cated. The modern spelling preserves the original «hw» with metathesis as «wh», but 
the pronunciation varies between /h/ (who) and /w/ (what). In Slavic as well as Iranian, 
plain velars were palatalized before front vowels such as i but otherwise remained stable, 
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e.g. Russian kto ‘who’ (PIE *k"o- + *tod) but čto ‘what’ (PIE *k"i/e- + *tod), or Avestan 
kas(a)- etc. ‘who’ (PIE *k"ó-s) but cis ‘who’ (PIE *k"í-s) (e.g., Fortson 2010: 231£., 421). 


Table 5.23: Developments of PIE velars (Fortson 2010: 58) 


centum < PIE > satom 
*k "E *K 

*k *k 
kw * k” 


PIE interrogatives can be reconstructed as *k"o-, *k"e-, *k"i-, and *k"u- (e.g., Cysouw 
& Hackstein 2011), but a controversy concerns the status of the last of them. Dunkel 
(2014: 436-441) argues that it must be reconstructed as *kú- ‘where’ and thus does not 
belong to the group of interrogatives starting with *k"-. According to him, the forms 
beginning with *k"- were actually derived from "ko in the first place (Dunkel 2014: 436). 
In his analysis, *k”i- and *k"é- are combinations of *kú- with anaphoric stems *i- and 
*e-, while the formation of *k"ó- is not solved entirely (see Dunkel 2014: 478). Whether 
this hypothesis is correct cannot be decided here, but it seems possible. If it is accurate, 
interrogatives in Indo-European are ultimately based on locative interrogatives as in 
German (see §5.5.3.2). 

Table 5.24 gives the list of interrogatives in PIE as reconstructed by Mallory & Adams 
(2006). A more extensive discussion of interrogatives can be found in Dunkel (2014: 453- 
479). It cannot be given here in its entirety, although some of his reconstructions have 
been integrated into Table 5.24. In some cases there is a corresponding demonstrative 
with the same endings, e.g. "to-deh, ‘then’, “to-r ‘there’, *to-ti ‘so much/many’ etc. Ac- 
cording to the authors, *k”om ‘when’ is a masculine accusative form of "k"os ‘who’, 
which seems extremely unlikely from a semantic point of view. 

Proto-Indo-European thus had K-interrogatives but no KIN-interrogative (because of 
a missing nasal). Perhaps, the relative *yo- goes back to an interrogative as well but is not 
attested as such (Mallory & Adams 2006: 421). There may have been one interrogative 
stem that did not start with *k"- but *m-. It has been reconstructed as *me/o- ‘who, 
which' (Mallory & Adams 2006: 421; Cysouw & Hackstein 2011; Dunkel 2014: 518-523), 
e.g. Tocharian A mánt ‘how’. 


5.5.3.2 Interrogatives in Germanic 


Table 5.25 gives the diachrony of several Germanic interrogatives and their modern Ger- 
man and English cognates. For some additional discussion see also Dunkel (2014). PIE 
*k"otero- ‘which of two’ has lost its interrogative meaning in German weder ‘neither’ 
and in English whether, used for indirect polar, focus, and alternative questions. 
German, Yiddish, and Plautdiitsch share a single resonance in v~, as German <w> is 
pronounced as [v] as well (Table 5.26). As mentioned before, English has a variation 
between w- and two forms starting with h-. Altai Low German vandie ‘when’ is closer 
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Table 5.24: Selected PIE interrogatives with some cognates according to Mal- 
lory & Adams (2006: 419f.); extended with the help of Dunkel (2014: 436-441, 


453-479); accents partly removed 


PIE form PIE meaning Selected Cognates 

*k"ós who NOM.SG.M NE who, Grk toü, Skt kas, Got was 

*k”ís who NOM.SG.AN Lat quis, Grk tis, Av ciš 

*k"ód what NoM/Acc.sG.N Lat quod, NE what, Skt kad 

"kd what NoM/acc.sG.N Lat quid 

*k"óm ?when ACC.sG.M Got han, OCS ko-gda, (Lat cum) 

*k"íh; how INST Lat qui, AE hwi, Polish czy 

*k"óteros which (of two) Grk poteros, Skt katara-, OCS koteryji 
*k"odéh4 when Skt kada 

*k"or where Lat quor, OHG hwar, Skt karhi 

*k”óti how much/many Lat quot, Grk posos, Skt kati 

"kéen how much/many Av aiti, Breton pet der ‘how many days’ 
*k"ehjk- what kind of [North West] OCS kakü, Lithuanian kok(i)s 
*Kk"oli ?how much only OCS kolikü ‘how large’, koli ‘how much’ 
*k“u(ú), ?*kú where Lat ubi, Grk pu-, Skt kū 

*k"u-dhe where OCS küde, OAv kuda, Skt kuha 


to Dutch wanneer than to German wann. Similar to Dutch waar and English where it 
also retains a reflex of a final r in vuue, while German only preserves an older form wor- 
in derived forms. But the form vou- directly corresponds to German wo-. Also compare 
German worauf, Dutch waarop, and Plautdiitsch vourgp ‘on what’ as well as German 
was, Dutch wat, and Plautdiitsch vaut ‘what’. Yiddish far vos has direct correspondences 
in German für was and English what for. This is a common European formation, e.g. 
Italian perché ‘why’ (cf. per ‘for’, che what). 

German exhibits an interesting congruence of the two forms was ‘what’ and wie ‘how’ 
that in certain circumstances are mutually exchangeable. 


(103) German 


a. Wie/Was ist dein Name? 
how/what is your.M.sc name 


‘What is your name?’ 


b. Bist du fertig oder wie/was? 
are 2sc ready or how/what 


‘Are you ready or what?’ 
c. Du bist der Neue, 


wie/was? 


you are the.m.sc new.one how/what 


"You are the new one, aren't you?’ 
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Table 5.25: Diachrony of selected German and English interrogatives (Hack- 
stein 2004: 175; Seebold 2002; Mallory & Adams 2006: 419f.; and Kroonen 2013: 


261, 264) 
PIE PG Old Germanic Modern Germanic 
"k"os M *h"az Got hvas - 
"k"isM *hViz OHG (h)wer NHG wer 
*k"eh»? F "bo OE hwa NE who 
?OE ha NE how 
*k"od N "baart Got hat, OHG (h)waz, OE NHG was, NE what 
hweet 
*"k"otero- “h”abera Got lwabar, OHG (h)wedar, (NHG weder ‘neither’, NE whether) 
OE hwaeóer 
*k"or *h"ar Got hvar, OHG (h)war, OE NHG wo, NE where 
hwer 
*k”om ? Got wan - 
? OHG (h)wanne, (h)wenne, NHG wann, (wenn ‘if’), NE when 
wenno, OE hwanne 
*k”ihı OE hwi, hwy NE why 
? Got lvaiwa, OHG (h)wio NHG wie ‘how’ 
Table 5.26: English, German (own knowledge), Yiddish (Katz 1987: 197; Jacobs 
et al. 1994: 404, 413-414, passim), and Altai Low German interrogatives (Jedig 
2014: passim); Plautdiitsch forms in square brackets from Nieuweboer (1999) 
English German Yiddish Plautdiitsch 
who [h-] wer [-e:e] ver veee 
how [h-] wie [-i:] vi [vóu] 
how much/many ` wieviel(e) [-i:-] vi fil(e) [vóu fiel] 
what was vos vaut 
which welch-er [-e] velkher [vóune-] 
what kind of was für [-ye] ein- [a-] vos far a 
where wo vu vuue 
wither, where to wohin vuhin vuuehaan 
whence, where woher [-e:e] vuuzheee, vouheee 
from 
when wann ven vonáie 
why, how come, wieso, weshalb, warum vuurom 


for what reason 


what for 


wozu, für [-ve] was 


tsu vos, far vos 
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d. Wie/(Was) du bist schwanger? 


how/what you are pregnant 


"You are pregnant?’ 


English cannot employ the interrogative how in these circumstances. The information 
on Yiddish and Plautdiitsch available to me is insufficient for a comparison. 

German was für ein- is a complex interrogative similar to English what kind of. Inter- 
estingly it is still separable as witnessed by the following examples. 


(104) German 


a. Was für ein Urlaub ist das? 
what.kind holiday is that 
b. Was ist das für ein Urlaub? 
what is that kind ` holiday 


"What kind of holiday is this?' 
An analogous situation can be seen in Yiddish. 


(105) Yiddish 


a. vos far a yontev iz dos? 
what.kind holiday is that 


b. vos izdos far a yontev? 
what is that kind holiday 
‘What kind of holiday is this?’ (Jacobs et al. 1994: 413) 


For Altai Low German no cognate is attested (but cf. Dutch wat voor een). The conju- 
gation of was fiir ein- in German is highly complex and depends on number, gender, and 
case (Table 5.27). In the plural the interrogative welch- ‘which’ substitutes for ein- (cf. 
eins ‘one’). Compare the full paradigm of the interrogative welch- ‘which’ (Table 5.28). 
The genitive forms are rare, but are listed for the sake of completeness. 


Table 5.27: Conjugation of was fiir ein- ‘what kind of’ 


Case SG.M SG.F SG.N PL 

NOM ein-e ein ein(s) @/welch-e 
ACC ein-e ein-en ein @/welch-e 
DAT ein-er ein-em ein-em @/welch-en 
GEN ein-es ein-er ein-es @/welch-er 


Both was fiir ein-/welch- as well as welch- may be used either pronominally or attribu- 
tively. If used attributively and in the plural, was fiir may be used on its own. In the 
singular there is the purely pronominal form was fiir eins for the neuter instead of the 
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Table 5.28: Conjugation of welch- ‘which (one)’ 


Case SG.M SG.F SG.N PL 
NOM welch-e welch-er welch-es welch-e 
ACC welch-e welch-en welch-es welch-e 
DAT welch-er welch-em welch-em welch-en 
GEN welch-es welch-er welch-es welch-er 


attibutive form was für ein. German wie viel-, Yiddish vi fil(e), and Plautdiitsch vóu fiel 
are based on the same underlying pattern as English how many. The conjugation of wie 
viel- exhibits the same case markers as the plural forms of welch-. While English employs 
how much instead of how many for mass nouns, German wie viel simply lacks inflection. 

In German, Plautdiitsch, Yiddish, and English the personal interrogative shows a small 
paradigm. The interrogatives meaning ‘what’ do not show case marking. 


Table 5.29: German, Yiddish, Plautdiitsch, and English conjugation of the per- 
sonal interrogative 


Case German Plautdiitsch Yiddish English 
NOM wer [-e:e] veee ver who [h-] 
ACC wen [-e:-] ? vemen whom 
DAT wem [-e:-] veem vemen whom 
GEN wessen ? vemens whose 


Of these four languages only German and perhaps Plautdiitsch preserve four distinct 
forms, although German wessen, which as has an archaic variant wes, is increasingly 
replaced with von wem ‘of whom’. German has a parallel paradigm and asymmetry of 
the definite article or demonstrative der ‘that one, the.m.sa’: der, den, dem, des(sen), but 
das ‘that’. 

Plautdiitsch vuurgm is comparable to German warum ‘why’, which is based on MHG 
war + umbe ‘where + around’. Several more forms in Plautdiitsch such as vou-bii (German 
wo-bei) have a locative basis. Unfortunately, only a few forms from Plautdiitsch are at- 
tested, which is why German forms are given instead (Table 5.30). English shares some 
of these formations, e.g. whereby, thereby, hereby etc. In one group the r is preserved 
but reanalyzed as belonging to the second element (wor-um » wo-rum, while in another 
group the r was lost or at least is not present. This seems also to hold for Plautdiitsch, 
e.g. vour.op (German wor.auf) and vou-fon (German wo-von). Within the first group the 
vowel following the r helped preserve it. Depending on the verb, some of these forms 
derived from ’where’ may also just mean ‘what’, which is highly unusual from a typolog- 
ical perspective (Cysouw 2007). Compare, for instance, English to consist [of what] and 
German [wor.aus] bestehen. The development of the meaning of the individual forms is 
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Table 5.30: German interrogative and demonstrative paradigms 


where there here Explanation 
wo(r) da(r) hin hier her plain 
wo-bei da-bei - hier-bei - “at, by, with’ 
wo-mit da-mit - hier-mit - ‘with’ 
wo-nach da-nach - hier-nach - ‘to, after’ 
wo-von da-von - hier-von - ‘from, of’ 
wo-zwischen da-zwischen- ?hier-zwischen- ‘between’ 
wo-hin da-hin - hier-hin - ‘there’ 
wo-her da-her - hier-her - ‘here’ 
wo-vor da-vor - ?hier-vor - ‘in front’ 
wo-durch da-durch - hier-durch - ‘through’ 
wo-zu da-zu - hier-zu - ‘to’ 

wor.in d(a)r.in - ?hier.in - ‘in’ 

wor.auf d(a)rauf ^ (hi)n.auf hier.auf (he)r.auf ‘on, up’ 
wor.unter d(a)runter (hi)nunter  hier.unter Uer unter ‘under’ 
wor.iiber d(a)rüber  (hi)nüber hier.über (he)Lüber ‘over’ 
Wor.aus d(a)r.aus ^ (hi)n.aus hier.aus (he)r.aus ‘out’ 
wor.ein d(a)r.ein (hi)n.ein hier.ein (he)r.ein “in(to)’ 
worum,warum  d(arum hin.um hier.um (he)r.um ‘about, in order to’ 


highly idiosyncratic. For example, wor.über may either mean ‘over what place’ but also 
‘about what’. The close relationship between da and wo (English there and where) may 
be directly traced to Proto-Indo-European where we find the two forms *tó-r and *k"ó-r 
(Mallory & Adams 2006: 419). German hier [-i:-] (English here, Plautdiitsch hie, Dutch 
hier) must be a Germanic innovation ultimately based on "buet. ‘this (one)' (Mallory & 
Adams 2006: 417f.), but it is somewhat obscure (e.g., Seebold 2002). 

There are also some parallel forms based on her ‘here (movement)' (a variant of hier) 
and hin ‘there (movement), towards’ that are also used as preverbs, e.g. her-kommen ‘to 
come here’, hin.zu-fiigen ‘to add’ etc. In German the reanalysis resulted in a few problems 
such as the fact that there are no separate forms "he- (hence ber um > he.rum > rum), *hi- 
(hence hin.ein > hi.nein > nein) etc. 


5.5.3.3 Interrogatives in Slavic 


Table 5.31 shows the development of selected Slavic interrogatives over the course of 
time. 

Russian kotóryj is a direct cognate of English whether, and či of why. According to 
Derksen (2008: 172, 227), the second part of Russian kogdá is a dative form of PS *gódo 
‘right time’ and goes back to PIE *g’od"-o- (English good goes back to PIE *g^od"-o-), 
based on the stem PIE *g"ed^- ‘join, fit together’ (Mallory & Adams 2006: 381). According 
to Derksen (2008), the final -li in PS *koli ‘how much’ is the Slavic question marker, but 
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Table 5.31: Diachrony of Slavic interrogatives with selected cognates according 
to Derksen (2008)); PS = Proto-Slavic, OCS = Old Church Slavonic 


PIE PS OCS Russian 

*k"o-ko- *kak'e ‘what (kind of)’ kaks kakój 

*k"o-ter-o- *koterb, *kotorb kotorbi kotoryj 

*k"o- *kejs ‘who, what, which’ ksi koj 

*k"o- + *tod *ksto ‘who’ kpto kto 

*k”o- + *ghodh-o- — "kogmda, *kogsdo ‘when’ kogda kogdá 

*k"oli *koli /how much' koli (kóli ‘if’) 
*koliko ‘how much’ koliko (Cz arch. koliko) 

*k"i/e- + “tod *čpto ‘what’ Crto čto 

"kant *či ‘CONS’ či ‘because’ (dial. či ‘if, or’) 

? “ake ‘such as’ jaks (Cz jaký ‘which’) 


more likely the form simply goes back to PIE *k"oli and is perhaps related to PIE *k"ehąli, 
whence Latin qualis (Mallory & Adams 2006: 420). It would be unexpected for a further 
derivational element to attach to the question marker as in PS “koliko. 

Russian as spoken in Inner Mongolia does not exhibit major differences with respect 
to Standard Russian. Some dialectal forms from Siberia can be found in Table 5.36 below. 
In some instances such as kto vs. xto ‘who’ or kak vs. jak ‘how’ only phonological differ- 
ences separate Russian and Ukrainian. In other cases there are different derivations such 
as in po-Gemt (DAT) vs. pó-áco (NOM) ‘why’. Only in a few instances are there altogether 
different interrogatives such as kogdá vs. koli ‘when’. The Russian and Ukrainian forms 
meaning ‘why’ are case forms (DAT, INSTR, GEN, NOM) used with or without prepositions. 
A preposition can also be found in Russian ot-káda/orkyza ‘whence’ (< OCS ot- < PIE 
haet ‘away, beyond’, Mallory & Adams 2006: 289ff.). Table 5.33 shows the paradigms 
of the interrogative pronouns meaning ‘who’ and ‘what’ in Proto-Slavic, Russian, and 
Ukrainian. Russian cto/aro what has the colloqial pronounciations [fto] and an informal 
variant co/uo [tfjo] (e.g., Bai Ping 2011). 

The difference between the genitive on the one hand and the nominative on the other, 
when filling in for the accusative, is connected with the distinction between animate 
and inanimate meaning (Cubberley 2002: 127). A difference between nominative (e.g. 
Ukrainian x-to ‘who, $-co ‘what’) and oblique stems (e.g., Ukrainian k-, &-) is also known 
from Iranian (see below). Similar to German, the selective interrogative shows extensive 
paradigms. Russian and Ukrainian also have a distinction between masculine, feminine, 
and neuter gender but preserve more cases. For reasons of space only some Ukrainian 
interrogative paradigms will be given in the following (Tables 5.34, 5.35). 

As would be expected, most interrogatives in the two pidgin languages are derived 
from Russian. Table 5.36 shows those interrogatives attested for Taimyr Pidgin Russian. 
An interesting fact is the frequent use of the oblique forms kogo and Gego There are 
three newly formed complex interrogatives. Mednyj Aleut also has some Russian inter- 
rogatives (85.4.3). Apparently, one Nganasan form was borrowed as well. 


159 


5 Survey of the grammars of questions in Northeast Asia 


Table 5.32: Selected interrogatives from Russian (Wade 2011: passim), Russian 
as spoken in Inner Mongolia (Bai Ping 2011: passim), and Ukrainian (Pugh & 


Press 1999: passim) 


Meaning Russian Russian (China) Ukrainian 

who kto/xro kto/xro xto/xTo 

whose éej/ueü tfjej/ueit éyj/auü 

what éto/uTo fto/uro ~ tfjo/uo ` &éo/uo, 
vySCo/sBunto 

who, which kotóryj/koropsri kotryj/korpuA 

whither kudá/xyza ku da/kyza kudy/kynnu 

what (kind of), which ^ kakój/kakoü ka koj/kakoit jakyj/akuá 

how, in what manner kak/xax kak/kak jak/aK 

when kogdá/xorza kag da/korya koli/konu 

where gde/rae gdje/rze deine 

how much/many sk6l’ko/ckombKo 'skolka/ckompxo ` skíl'ky/ckimsku 

whence ot-küda/orkyza zvídky/asinku, 
zvidkil /3Binkib, 
(z)vidkilja/sBigkina 

why po-éemü/noueMy,  za-tfjem/sauedM | éComü/uowy, 

Za-Cém/aaueM Cohó/uoro, 


pó-3co/riomio, 
navy-&co/nanurio 


Table 5.33: Proto-Slavic, Russian, and Ukrainian interrogative paradigms (Pugh 
& Press 1999: 178; Shevelov 1993: 961; Sussex & Cubberley 2006: 269f.; Bai Ping 


2011: 74, 78) 
PS Russian Ukrainian 
Case who what who what who what 
NOM “*k-b-to  “é-b-to k-to é-to [fto] x-to ščo 
GEN *k-ogo *č-eso/-Þ»so k-ogó [kavo] č-egó [tftvo]  k-ohó č-ohó 
DAT *k-omu *č-emu k-omú č-emú k-omú č-omú 
ACC - NOM = NOM = GEN - NOM = GEN - NOM 
INSTR “k-émb *č-imb k-em Cem k-ym Cum 
LOC *k-omp  "é-emb k-om ¢-ém k-ómu, č-ómu, 
k-im č-im 
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Table 5.34: Conjugation of Ukrainian kotoryj ‘which’ (Pugh & Press 1999: 180) 


Case M N F PL 

NOM kotryj kotré kotra kotri 

GEN kotróho id. kotróji kotryx 
DAT kotrómu id. kotrij kotrym 
ACC = NOM/GEN = NOM kotrü = NOM/GEN 
INSTR kotrym id. kotróju kotrymy 
LOC - DAT, kotrím id. = DAT - GEN 


Table 5.35: Conjugation of Ukrainian jakyj ‘what kind of’ (Pugh & Press 1999: 


180) 
Case M N F PL 
NOM jakyj jaké jaká jakí 
GEN jakóho id. jakóji jakyx 
DAT jakómu id. jakij jakym 
ACC = NOM/GEN = NOM jaku = NOM/GEN 
INSTR jakym id. jakóju jakymy 
LOC - DAT, jakím id. = DAT - GEN 


Table 5.36: Taimyr Pidgin interrogatives (Stern 2005; 2012: 435ff., 498); some 


variants were excluded 


Form Meaning Analysis 

syly/cyny who Nganasan sili ‘who’ 

kto/xro who - Russian 

kakoj/kakoit which - Russian 

gde/rne where - Russian 

kuda/Kyzia whither - Russian 

zaCem/3aueM why - Russian 

kak/xak how - Russian 

kogda/koxya when = Russian 

kogo/koro who Russian GEN/ACC 

Cego/uero what Russian GEN 
kudy-mera/xyzbr Mepa where Russian mera/Mepa ‘measure’ 
kudy-mesto/ky I Mecro where Russian mesto/Mecro ‘place’ 
kakoj storona/kakoii cropoHa ` whither Russian storona/cropona ‘side’ 
akto/axro who - dialectal Russian 
pocto/mouro why = dialectal Russian 

čo/uo what, why = dialectal Russian 
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The phenomenon of one form meaning ‘what’ and ‘why’ is also known from the Ira- 
nian languages Sogdian ((a)éu) and Khotanese (cu), see §5.5.3.4. 

Only a short list of interrogatives in Chinese Pidgin Russian can be assembled from 
the material provided by Shapiro (2010). Most forms are of Russian origin, but at least 
one is from Chinese. The interrogative mnogo-malo entirely consists of Russian material 
but follows the Chinese structure and meaning (Table 5.37). 


Table 5.37: Chinese Pidgin Russian interrogatives mentioned by Shapiro (2010); 
the form in < > was given in Chinese Pinyin 


Form Meaning Analysis 

kaka why = Russian kak/kak 

kakoj which = Russian kakoj/kakort 
<gedao’erli> which = Russian kotóryj/koropsrit 
pocheto why - dialectal Russian 


mnogo-malo how much Russian mnógo/Mnoro ‘much’, málo/wazo ‘little’, cf. 
Chinese duosháo 47> 
$yma what = Chinese shénme {tA 


5.5.3.4. Interrogatives in Iranian 


Most interrogatives in those Iranian languages included here are synchronically opaque 
and their etymologies too complex to be given here in their entirety. But consider the 
interrogatives in Table 5.38. As can be seen, Sogdian interrogatives have some direct 
correspondences in, or at least similarities to, Yaghnobi, the only closely related modern 


Table 5.38: Sogdian interrogatives (Yoshida 2009: passim) in comparison with 
Yaghnobi (Geiger 1901: passim; Bielmeier 1989: 482, 484) 


Meaning Sogdian Yaghnobi 
Geiger Bielmeier 
when kaóa kad kad 
which kataam kaam kom, opt, komi 
kuum, opt, kuumi 
where (a)kuu kuu 
who (a)ke, opt (a)kya  kax,oBrkài kax, opt. kay 
what (a)éu éaa €o, OBL Coy 
why (a)éu čuu 
how many/much čaaf(ar) čaaf čof 
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language spoken in Tajikistan. Most up-to-date material on Yaghnobi was published in 
Tajik and has thus to be excluded. 

Some of these forms can directly be traced back to Indo-European. For instance, Yagh- 
nobi kad ‘when’ goes back to PIE *k"odéh; and kuu ‘where’ to PIE *k”u(ú) (or *ku-). The 
occasional initial vowel in Sogdian is perhaps prothetic (Yoshida 2009: 286). Khotanese 
has several comparable forms such as kaama- ‘which’, ku ‘where’, kye ~ ce ‘who’, cu 
‘what, why’, and some additional forms such as craama- ‘what kind of’, ciiya ‘when’, 
caalsto ‘whither’, or canda, cändäka, cerá ‘how much/many’ (Emmerick 2009: 387, 389, 
passim). Table 5.39 lists interrogatives from Sarikoli. In order to put them into a proper 
context, interrogatives from the closely related language Wakhi are listed as well. Re- 
member that these two languages are collectively called Tajik in China but that Tajik is 
really a variety of Persian. 


Table 5.39: Selection of Sarikoli, Wakhi (Gao Erqiang 1985: passim), Tajik, and 
Persian interrogatives (Windfuhr & Perry 2009: 438); Sarikoli form in square 
brackets from Xiren Kuerban & Alimujiang Xiren (2015: 88, 120, 162f.); Wakhi 
forms in square brackets from Bashir (2009: 831); not all variants are listed 


Meaning Sarikoli Wakhi Tajik Persian” 
which tfidum [kum(d)] kadom kodaam 
where ko, ku-dzui kum-dzai ku, ku-jo ku, ko-jaa 
when [tfum] key kay 

who tfoi, OBL -tfi- kui kii ki 

what tseiz, [tseiz] tsiz, [Ciz] — cii ~ ča- ée(-) 

why [tsarang ‘how’ ] [čir] čaro čeraa 
how many/much tsund, [tsond, tfand] [tsum] čand čand 


According to Xiren Kuerban & Alimujiang Xiren (2015: 163), Sarikoli tsond ‘how many/ 
much’ stands opposed to tfand ‘how many’, which may have been adopted from Tajik. 
The locative interrogatives contain words meaning ‘place’. All Iranian languages in- 
cluded here preserve the split between the two resonances k~ and ¢~ (or c-), the latter 
of which goes back to *k- as well. Sarikoli, apart from this distinction, has an innovative 
third type t/~ < k- that cannot be found in Wakhi or Persian. In Gao Erqiang (1985) it 
thus shows synchronic variation between k-, ts-, and t/-. Khotanese similarly had free 
variation between kye and the more innovative ce ‘who’ (Emmerick 2009: 387). Sarikoli 
tfoj ‘who’ has a special oblique form tei that is the basis for case marking (cf. Sogdian 
and Yaghnobi in Table 5.38). 


(106) Sarikoli 


a. tfoj a-ta Oud? 
who.NOM Acc-2sc.Acc hit.PsT 
"Who hit you? 


HSeveral Persian interrogatives have been borrowed by Moghol (85.8.3). 
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b. taw=at a-tei Oud? 
2SG.NOM=2SG.PST ACC-who.oBL hit.PsT 
‘Whom did you hit?’ 

c. taw=at tei=ri Oud? 


2sG.NOM-2SG.PST who.OBL-DAT hit.PsT 
"Whom did you give (it) to?' (Kim 2014: 9) 


Gao Erqiang (1985: 35) has the full paradigm as follows: Now Hoi, GEN tfi-an, DAT tfi-ri, 
ACC a-tfi. 
5.5.3.5 Interrogatives in Tocharian 


Despite the fact that the two Tocharian varieties are thought to be relatively closely 
related, there are nevertheless many differences within the system of interrogatives (Ta- 
ble 5.40). 


Table 5.40: Selection of Tocharian interrogatives according to Adams (2013) 
with additional Tocharian A data by Sieg & Siegling (1931: 176-191) and Carling 


(2009) 

TA TB 
who, what kus kuse 
why k,yal kà 
where tà k,ta-mem 
how, when, if kupre kwri, kryi 
how much/long kos kos, kot 
who, which, what kind of  ántsam, antsam intsu 
where (antan(n)ene conj) inte, ente 
when (antane CONJ) inte, ente 
who, what, which maksu 
how ma(n)t mákte 


This might be additional evidence for Peyrot's (2010: 144) assumption that a “Proto- 
Tocharian may have differed more from its daughter languages than is often suggested 
by superficial similarities between them", which could be the result of later convergence. 
The best etymologies for Tocharian interrogatives have been given by Adams (2013), but 
these are too complex and somewhat too uncertain to be given here in full length. 

There is a resonance in k(,,)~ that , as seen before, is a reflex of PIE *k"-. There are 
also interrogatives starting with m- such as TA mdnt that might be based on an in inter- 
rogative stem PIE * me/o- (Mallory & Adams 2006: 421). However, Adams (2013) assumes 
that TB máksu ‘who, what, which’ and mákte ‘how’ as a middle part contain the ac- 
tual PIE interrogative stem "k"i/u-, preceded by the PIE particle "men and followed by 
different demonstratives or relatives. TB mdksu ‘who, what, which’ exhibits a more or 
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less full paradigm based on person, number, and gender, e.g. máksu *NoM.sc.w', mäksā, 
‘NOM.SG.F’, müktu 'NoM.sG.N' (Krause & Thomas 1960: 166). There is some agreement that 
TA kus and TB kuse < Proto-Tocharian *k"2sé ‘who’ are similarly combinations of an in- 
terrogative with a demonstrative, perhaps PIE "kt + “so (e.g., Kim 2012: 38). This is rem- 
iniscent of Slavic “koto ‘who’ < PIE "k"o- + “tod (see Table 5.33 above). Indo-European 
had a distinction between three demonstratives, *so ‘that one, he’, *seh, ‘that one, she’, 
and tod ‘that one, it’ (Mallory & Adams 2006: 417). The difference lies in the fact that 
Tocharian *k” asé contains the first of these, and Slavic *koto the last. TB k,se ‘who’ (TA 
kus) and k,ce ‘whom’ (TA kuc) later had the abbreviated forms se and ce, respectively 
(Kim 2012: 38), which is reminiscent of TA ta ‘where’ as opposed to TB k,ta-. In Tochar- 
ian B the meaning of both k,se and máksu encompasses both ‘who’ and ‘what’, which, 
apart from Baltic languages and Kusunda, is quite exceptional in Eurasia. 


5.6 Japonic 


5.6.1 Classification of Japonic 


As is by now well established, Japanese is not an isolated language, as was, for in- 
stance, claimed by Shibatani (1990: 89). Instead, Japanese is merely the major, but by no 
means the only, representative of a language family called Japanese-Ryükyüan or sim- 
ply Japonic (e.g., Tranter 2012b: 3f.). A simplified classification of Japonic languages may 
tentatively be represented as in Figure 5.3 (based on Pellard 2009: 264; Chien Yuehchen 
& Sanada Shinji 2010; Shimoji 2010; Hasegawa 2015: 21ff.), excluding most historically at- 
tested and possible Para-Japonic languages. Only those Ryükyüan languages or dialects 
mentioned during this section are listed. 

The primary split in Japonic is between Japanese and Ryükyüan. Mainland Japanese 
constitutes a dialect continuum that can roughly be classified into four larger areas called 
Eastern, Central, Western, and Kyüshü (Hasegawa 2015: 21f.). The Hachijo dialects and 
the Okinawan dialect influenced by Ryükyüan form separate groups in themselves. For 
reasons of space and lack of sufficient information, a focus will be on Modern Stan- 
dard Japanese in this study.” A special case is Yilan Creole spoken on Taiwan and has 
thus also been listed separately. Even though the lexicon is mostly based on Japanese, 
Yilan is a creole language that also exhibits certain influences from Austronesian lan- 
guages, especially Atayal (Chien Yuehchen & Sanada Shinji 2010). Ryükyüan languages 
spoken in the Ryükyü archipelago may be classified into two main branches, Northern 
and Southern Ryükyüan, each of which splits into two branches (Shimoji 2010). South- 
ern Ryükyüan has also been called Sakishima (Bentley 20082). Yonaguni is treated as a 
separate branch of Ryükyüan by Izuyama (2012) and as a separate subbranch of Southern 
Ryükyüan by Bentley (20082: 242), but is often included within the Macro-Yaeyama sub- 
branch of Southern Ryükyüan, the other branch of which is Miyako. Northern Ryükyüan 
can be divided into Amami and Okinawan. There is a large amount of variation among 
Ryükyüan. According to Lawrence (2012: 380), there are 35 “dialects” within Miyako and 


12A Handbook of Japanese Dialects has been announced by De Gruyter for 2019. 
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Figure 5.3: Classification of Japonic 
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20 within Yaeyama alone. Of course, a classification into languages and dialects is diffi- 
cult and even somewhat spurious. But clearly the Ryükyüan Islands can be considered 
a treasure trove of linguistic diversity, of which only some parts can be included in this 
chapter. As is common practice today in the study of Ryükyüan, the place name will also 
indicate the language spoken at that place, i.e. Irabu on Irabu island etc. 


5.6.2 Question marking in Japonic 


Tranter & Kizu (2012: 295) give a good summary of question marking strategies in mod- 
ern Standard Japanese. 


Questions of all types including wh-questions and yes/no questions are expressed 
by a change in intonation and the addition of a particle at the end of the sentence: 
familiar-style -no, -, or polite-style -ka. Soliloquy-type questions that do not nec- 
essarily require a response from a hearer use -kana(a) or -kashira (female). There 
is no change in word order, and no fronting in wh-word questions. Questions that 
present alternatives, including those that ask a question in an affirmative form 
with a negative alternative of the same situation, have the structure of two sepa- 
rate questions. 


The speech level differences are not as strongly developed as they are in Koreanic ques- 
tions (85.7.2). The default and polite question marker in Japanese is the sentence-final 
and possibly enclitic particle ka 7^. The marker can be found in polar, alternative, and 
content questions. 


(107) Japanese 


a. dareka ga  kimashita ka? 
someone NOM come.PST.POL Q 


‘Did someone come here?’ (Hasegawa 2015: 104) 


b. sono hon wa omoshiroi desu ka tsumaranai desu ka? 
that book Top interesting cop Q boring COP Q 


‘Is that book interesting or boring?’ (Hinds 1984: 159f.) 


c. o-namae wa nan desu ka? 
HON-name TOP what coP Q 


"What is (your) name?' (own knowledge) 
The same marker also appears at the end of what seem to be focus questions. The follow- 
ing two examples were elicited from a native speaker living in Germany in November 
2015. The glossing follows Hasegawa (2015). 
(108) Japanese 


a. ashita wa gakko ni iki-masu ka? 
tomorrow TOP school to go-NPST.POL Q 


‘Is it tomorrow that you are going to school?’ 
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b. ashita, gakko ni wa iki-masu ka? 
tomorrow school to TOP go-NPsT.POL Q 


‘Is it to school that you are going to tomorrow?’ 


Similar to Korean (85.7.2) and Wutun (85.9.2.1), it apparently is the topic marker wa that 
follows the focused or perhaps rather topicalized element while the sentence otherwise is 
identical to a plain polar question. Japanese has a special way of forming topic questions 
that contain the same topic marker wa but have a truncated form. 


(109) Japanese 
ano, kyodai wa? 
uh siblings Top 
'So, do you have any brothers or sisters?' (Hinds 1984: 166) 


In Old Japanese, the particle ka already existed but differed from the modern Japanese 
one in its syntactic behavior. According to Vovin (2009: 1220), it was present in both 
Eastern and Western Old Japanese and has the same scope as in modern Japanese. But, 
in contrast to the strict sentence-final position today, the particle could appear in other 
positions as well. Apparently, the particle also marked focus questions and attached to 
the focused element in both focus and content questions. 


(110) Old Japanese 
a. Suz 


ta-ka ta-sar-e? 
who=Q EMP??-go.away-EV 
"Who goes away?' 
b. GRAB 2st Ka 
KO YOPI-ka KYIMYI-NKA WA-Nkari K-YI-[i]mas-am-u? 


this night-o lord-Poss 1sG-DIR è come-INF-HON-TENT-ATTR 
‘Is it tonight that (my) lord will come to me?’ (Western; Vovin 2009: 1220, 
1225) 


Typologically, this is a change similar to the one observed from Middle Mongol to mod- 
ern Mongolian (see §5.8.2). 

In Eastern Old Japanese =ka is attested as a marker for polar, focus, and content ques- 
tions and triggers kakari musubi ‘focus concord’ (see further below): It “forces the main 
verb to take an attributive suffix, regardless of whether it follows or precedes the verb” 
(Kupchik 2011: 834). 


(111) Old Japanese (Eastern) 


a. RRMRRHS 
aNtó-ka a-Nka | se-m-u? 
what=Q 1sG-POSS do-TENT-ATTR 
"What should I do?’ 
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b. WAH f SUR D 
opuse-tamap-o-ka? 
assign.INF-HON-ATTR- Q 
‘Has (the emperor) given (me) the order?’ 
C. Zoos, 
yama-ni=ka ne-m-u mwo / yaNtór-i pa na-si-ni? 
mountain-Loc-Q sleep-TENT-ATTR FOC /lodge-N TOP not.exist-FIN-LOC 


"Shall (I) sleep in the mountains since there is no lodging (here)?' (Kupchik 
2011: 834, 835) 


According to Vovin (2009: 1229), the particle has a cognate in Ryükyüan languages and 
can be traced back to Proto-Japonic. 


The interrogative particle ka ~ ga (« *-N ka) is well attested in both Old Ryükyüan 
and modern dialects. However, as far asI can tell, Ryukyuan ka - ga appears exclu- 
sively in wh-questions [CQ]. Thus, in all probability, WOJ [Western Old Japanese] 
ka in general questions [PQ] represents a Japanese innovation, and we should re- 
construct PJ [Proto-Japonic] *ka, interrogative particle in wh-questions. 


An example from Old Ryükyüan is the following: 


(112) Old Ryükyüan 
WEDOALESLE 
keo wa no-ka s-i-yor-asiyo? 
today Top what-Q do-INr-exist-sUP 
‘What would (they) do today?’ (Vovin 2009: 1229) 


Old Korean had a similar marker -ka & that might be somehow related to the Japonic 
form (85.7.2). But as we will see later in some Ryükyüan languages, there is also the 
possibility that the marker is the result of a language internal development from a focus 
marker. 

Japanese exhibits an instance of the grammaticalization from nominalization to ques- 
tion marker through ellipsis of the following copula and original question marker. 


(113) Japanese 
doko e  ikuno (desu ka)? 
where ALL go N>Q COP Q 


‘Where are you going?’ (Hinds 1984: 163) 


The suffix no originally may have been the genitive case marker (Shibatani 1990: 258). 
See 85.1.2 on Ainuic and below on Ryükyüan for similar developments from nominalizer 
to question marker that may be due to contact with Japanese. According to Hasegawa 
(2015: 297) no adds “various nuances, typically softening the locution when addressing 
an interlocutor. It is, therefore, considered mildly feminine even though male speakers 
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also use this particle.” The most aberrant Japanese dialect, Hachijo, has a marker kai that 
was written attached to a preceding word or with a hyphen and translated with Japanese 
ka 7^. Presumably, it is either a particle or an enclitic. Content questions seem to remain 
unmarked (Kokuritsu Kokugo Kenkyüjo 1950: 130, 208). The Tsuruoka dialect in northern 
Honshü marks polar questions with ga and content questions with na (Matsumori & 
Takuichiro 2012: 323, 325). In the Ei dialect in southern Kyüshü, both polar and content 
questions take the marker ka or its formal variant kana (Matsumori & Takuichiro 2012: 
342). 

Yilan Creole has the two optional sentence-final markers ga and no, corresponding 
to Japanese ka and no, respectively. As opposed to Japanese ka, ga apparently does not 
appear in content questions, which remain unmarked. This may be due to influence from 
Atayal or Chinese. Polar questions generally have a rising intonation. 


(114) Yilan Creole 


a. korega nani? 
this Tor what 


"What (is) this?’ 
b. anta kyolai om ga? 
2sc sibling have Q 


'(Do) you have siblings?' 


c. antano hoyin ga qalux no? 
2sG GEN dog top black Q 


'(Is) your dog black?'? 


d. anta teykan 'suw? 
2sG chair heavy 


'(Is) your chair heavy?’ (Peng Qiu 2015: 52, 54, 55) 


Yilan Creole questions thus behave very similarly to those in Japanese but have a slightly 
different form and semantic scope. 

The last question marker mentioned by Tranter & Kizu (2012: 295) as quoted above is 
-ka.na(a) or -ka.shira, formerly used in women's speech, employed for questions to one- 
self. According to Hasegawa (2015: 294) ka.shira "expresses uncertainty and curiosity" 
and has been translated as ‘I wonder’. As we will see below, Ryükyüan languages have 
similar markers containing an element -ka- - -ga- that was translated in the same way. 

Both Eastern and Western Old Japanese had another question marker ya found in 
polar and focus questions. Its behavior in these two dialect groups is rather similar, but 
there are minor differences. For Eastern Old Japanese we have the following description 
by Kupchik (2011: 832): 


BThe words hoyin ‘dog’ and qalux ‘black’ have been borrowed from Atayal. 
“The words teykan ‘chair’ and ’suw ‘heavy’ derive from Atayal. 
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When in the sentence-final position, it follows the copula tö, the defective verb tö 
‘think; or the evidential form of the verb. The examples with the evidential are used 
to make ironic questions [...]. When this particle is fronted to a pre-verbal position, 
the verb form must take the attributive suffix [...]. Unlike WOJ, where ya is amply 
attested directly after the final form of a verb or the final exclamative -umo (Vovin 
2009: 1211), such usages are unattested in EOJ. 


In Western Old Japanese, the non-final position - presumably found in focus questions 
- also accompanies the attributive form of the verb. In case it is sentence-final - in polar 
questions - it may follow final, evidential, and exclamative forms, but not attributive 
ones (Vovin 2009: 1211). 


(115) Old Japanese (Eastern) 

a. FRORE/RAAFESR 
uwe | kwo-na-N-kyi / Nka-ku kwopiy-m-u tó-ya? 
sow.INF dim-water-Loc-leeks / be.thus-1nF long-for-tent-FIN think/say=Q 
‘Do (you) think (I) love the sowed water leeks so much?’ 

b. BARA FI th / MNFRS oe 
asu-yuri=ya / kaye-Nka muta ne-m-u? 
tomorrow-ABL-Q / reed-Poss together.with sleep-TENT-ATTR 
‘From tomorrow shall (I) sleep together with the reeds?’ (Kupchik 2011: 832) 


(116) Old Japanese (Western) 

a. BRP GST 
na pa kyik-as-u-ya? 
2SG TOP ask-HON-FIN=Q 
‘Shall (I) ask you?’ 

b. HIR ARIE 
kyimyi pa=ya na-kyi? 
lord TOP=Q NEG-ATTR 
‘Don’t (you) have a lord?’ (Vovin 2009: 1211, 1215) 


Similarly to the particle ka, Vovin (2009: 1219) assumes that ya has cognates in Ryükyüan 
and that it can be traced back to Proto-Japonic. 


The cognates ya - yaa of the Western Old Japanese interrogative particle ya are 
well attested in modern Ryukyuan dialects, although in most dialects ya ~ yaa have 
the function of a confirmation seeker, like MdJ ne, and not an interrogative particle. 
As far as I can tell, ya ~ yaa occurs only in sentence-final position. 


But according to Shinzato (2015: 305), the Old Japanese marker rather corresponds 
to the Ryükyüan question marker (y)i, on which see below. Whether Ainu ya may be 
compared remains an open question, but it may well have been borrowed from older 
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stages of Japanese (§5.1.2). A sentence-final particle ya in Standard Japanese is usually 
accompanied by falling intonation and does not express questions (Hasegawa 2015: 298). 

In Standard Japanese there is another sentence-final particle kke, the function of 
which Hayashi (2010a: 2687) explains as follows: 


Thus, unlike ka and no, kke makes implicit reference to knowledge or informa- 
tion previously held by the speaker and shared with the addressee, but which the 
speaker has somehow forgotten or is unsure about. The particle then serves to 
enlist collaborative participation of the addressee in the process of regaining that 
knowledge/information. 


(117) Japanese 
are ichi-nen deshita kke:? 
EXCL one-year COP.PST.POL Q 


"Wait, is (your visa valid) for one year?' (Hayashi 2010a: 2687, simplified) 


There is also a special marker tte for echo questions, which is a variant of the quota- 
tive marker to used in casual speech. But to cannot function as a sentence-final particle 
(Hasegawa 2015: 310f.). 


(118) Japanese 
dare deshita tte? 
who COP.PST.POL QUOT>Q 


"Who did you say it was?’ (Hinds 1984: 165) 


According to Hinds (1984: 165), the marker has its origin in an ellipsis of the subsequent 
speech act verb followed by the question marker ka. 

In a comparative study of question-response sequences in ten different languages, 
Japanese had the highest ratio of polar questions (85%), as opposed to content questions 
(15%). There was only one alternative question. But 39% percent of the polar questions 
had a declarative form and 30% were actually tag questions (Hayashi 2010a: 2686). There 
were three different tag question markers, janai, desho, and yo ne. The first is a negative 
copula ja-nai ‘cop-NEG’ and can roughly be translated as ‘isn’t it?’. It has the shorter 
version jan and a more polite variant janai desu ka. The tag marker desho and its less 
polite variant daro are actually so-called conjectural copula forms meaning ‘probably 
be’ (Hasegawa 2015: 80) and “ask for the addressee’s confirmation to the speaker’s con- 
jecture” (Hayashi 2010a: 2689). They roughly correspond to English tag questions such 
as ‘is it?’. The last form yo ne is a combination of two different markers the function of 
which goes well beyond the marking of questions (see Hasegawa 2015: 299ff.). According 
to Hayashi (2010a: 2690), “these particles are used sentence-finally to make an assertion 
while seeking confirmation/agreement to it from the addressee.’ In combination, yo ne 
was translated as 'don't you think?' But ne can also mark questions on its own. It has a 
variant na that is usually used by men. 
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(119) Japanese 
ii tenki da na? 
good weather cop Q 
‘It is a fine day, isn’t it?’ (Hasegawa 2015: 296) 


Whether this might be a cognate of a question marker found in several Ryükyüan lan- 
guages remains unclear to me. 

The marking of questions in Ryükyüan languages is less well described than for Jap- 
anese. In general, there are similar patterns with sentence-final particles, but in some 
languages there are question suffixes and the pattern of question marking may be quite 
complex. In Ura (spoken on Amami Oshima) polar questions, for instance, there is either 
a rising intonation or a simple sentence-final clitic =na ~ =nja. 


(120) Ura 
kuri-ja hon-na? 
this=ToP book=Q? 


‘Is this a book?’ (Shigeno 2010: 27) 


There is an additional marker for “self-questions”, the semantic scope of which was 
not given. It might belong to other forms meaning ‘I wonder’, e.g. Japanese -ka.na. 


(121) Ura 
an ?cju=ja taru-kai? 
that person-ToP who-Q 


"Who is that person?' (Shigeno 2010: 27) 


Shigeno (2010) does not further specify whether content questions receive a special 
marking or not, but among his examples there are the markers -joo (in CQ), -kana (in 
CQ), and =ja(a) (in PQ), that were glossed as question markers but not further explained. 


(122) Ura 

a. wan-ga kak-ju-O-n=ja? 
1sG=NOM write-IPFV-(NPST)-ADN=Q 
‘Should I write?’ 

b. an ?cju=nkja=ja icu-raga kuma=nan ur-i=joo? 
that person=APPR=TOP when=ABL here=Loc1 exist-NPST-Q 
"What time did those people get here?’ 

c. nan-cjuu-no-kana? 
what=QUOT=GEN=Q 


‘How should (I) express this?’ (Shigeno 2010: 20, 23, 30) 


The enclitic =ja(a) is formally identical to the topic and persuasion markers. 
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A much better description can be found for the closely related language Yuwan (also 
spoken on Amami Oshima). In this variety, the marking of questions is much more com- 
plicated and displays a typologically very interesting pattern. Similar to Ura, polar ques- 
tions are either expressed with rising intonation or an enclitic =na. 


(123) Yuwan 
uro-o0  koow-an=na? 
2sG- TOP buy-NEG=Q 


‘Don’t you buy it?’ (Niinaga 2015: 337) 


But questions in Yuwan may also be expressed by means of affixes. The information is 
insufficient to decide about the distribution of these three different marking strategies. 


(124) Yuwan 
uro-o koo-ju-mi? 
2sG-TOP buy-IPFV=Q 
‘Do you buy it?’ (Niinaga 2015: 337) 


Altogether, there are the three suffixes, - mi for polar questions, -u for content ques- 
tions, and -ui for focus questions. If that is not enough, the latter two suffixes are not 
used in isolation but obligatorily combine with focus markers that are specific to the 
question types, i.e. -ga in content and =du in focus questions. 


The clitic =du cannot appear with -u, while =ga cannot appear with -ui. This kind 
of phenomenon, where the presence of a focus clitic correlates with the type of 
verbal inflection, is known as kakari musubi in Japanese linguistics (Niinaga 2010: 
75) 


The phenomenon called kakari musubi will be discussed in more detail below. As seen 
above, neutral polar questions take no focus marking. 


(125) Yuwan 

a. kuri-ba tu-ju-mi? 
this-Acc take-1PFv-Q 
"Will (you) take this?’ 

b. nuu-ba-ga tu-jur-u? 
what-Acc-roc take-IPFV-Q 
"What will (you) take?' 

c. kuri-ba-du  tu-jur-ui? 
this-Acc-roc take-IPFv-Q 


"Will (you) take this?’ (Niinaga 2010: 76f.) 
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From a diachronic perspective the content and focus question markers perhaps con- 
tain the same element -u. The element -i possibly has a cognate in Shuri -i(i) (or perhaps 
=ji). Clearly, Yuwan -mi is cognate with Shuri and Tsuken -mi. It has been proposed 
that these also contain an actual question marker -i ~ -i. The focus marker =du may also 
appear in declarative sentences while =ga is restricted to content questions (Niinaga 
2010: 75). The three question markers exhibit an interesting interaction with polarity 
and tense (Table 5.41). In the past tense the question markers attach to the “declarative” 
(past) marker -tar, the loss of the r before consonants is regular. In non-past tense, on 
the other hand, the question markers replace the declarative -i (perhaps cognate with 
Shuri -i ‘PRs.PTCP’). 


Table 5.41: Declarative and question markers in Yuwan (Niinaga 2010: 64) 


Category Assertion Negation 
NPST DECL -i -an 
PQ -mi -ami 
FQ -ui = 
CQ -u = 
PST DECL -tar -an-tar 
PQ -ta-mi -an-ta-mi 
FQ -tar-ui -an-tar-ui 
CQ -tar-u -an-tar-u 


In the non-past the polar question marker has a special negative form -ami as opposed 
to the plain negative -an. Negative forms of -ui and -u apparently only exist in the past 
tense. 

Another question marker -ga(i) is always used in combination with the suppositional 
enclitic =daroo. The following sentence was translated as a tag question by Niinaga. 


(126) Yuwan 
an ?cjoo sjensjee-ja | ar-an-daroo-ga(i)? 
that person.roP teacher- TOP COP-NEG.NPST=SUPP=Q 


*(I) suppose that that person is not a teacher, is that right?’ (Niinaga 2010: 73) 


Niinaga (2010: 72) also used the gloss 'confirmative question’ for -ga(i). Another en- 
clitic =jaa “is used only with intentional inflection to confirm the hearer acknowledges 
the intention of the speaker” (Niinaga 2010: 72). 


(127) Yuwan 
wan-ga ik-joo=jaa? 
1sG=NOM go-INT=Q 
‘I will go, right?’ (Niinaga 2015: 329) 
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Van der Lubbe & Tokunaga (2015) give an overview of two dialects spoken on Oki- 
noerabu among the Amami islands, Masana in the west and Kunigami in the east. But 
most examples for questions are from Masana. Masana has the same enclitic =na ~ =nja 
for polar questions as several languages mentioned above, but in content questions the 
same form =joo as in Ura is found. 


(128) Okinoerabu (Masana) 
a. fatia-n=nja? 
know-IND=Q 
‘Do you know?’ 
b. ?uda=gatfi=joo? 
where=DIR=Q 
"Where are you going?’ (van der Lubbe & Tokunaga 2015: 353) 


The dubitative suffix -ra usually combines with the focus marker =ga and was translated 
as ‘I wonder if’ but is not a question marker in the strict sense. Another dubitative marker 
-ro on the other hand is “used to ask questions in a less direct way” (van der Lubbe & 
Tokunaga 2015: 357). 


(129) Okinoerabu (Masana) 
kiba-ti mee-ro? 
work.hard-MED exist.HON-DUB 


‘Are you working hard?’ (a greeting) (van der Lubbe & Tokunaga 2015: 357) 


Exactly the same description was given for three other markers. The enclitic =kaja 
could be related to Shuri =gajaa. Both can be found in content questions, e.g. taru=kaja? 
"Who would that be?’ (van der Lubbe & Tokunaga 2015: 362). The origin of the other two 
(PQ =sa, CQ =do) remains unclear for now. According to van der Lubbe & Tokunaga (2015: 
361) “in the past tense, the medial converb is used rather than the past tense suffix -ta-" 
In Kunigami and other varieties in the eastern part, the verbal suffix -jee is employed 
instead of the enclitic =na ~ =nja. This might be a cognate of Yuwan =jaa and Ogami es 
that we will soon encounter, e.g. kuruma ?a-jee? ‘car cop-Q' ‘Is there a car?’ (van der 
Lubbe & Tokunaga 2015: 362). 

Shuri (or Okinawan) as spoken on Okinawa has several question markers and displays 
strong similarities to other languages mentioned thus far. There is a particle naa that 
has a short vowel in Ura, Yuwan, Tsuken, Tarama, and Ikema and in these languages 
has sometimes been analyzed as enclitic, sometimes as freestanding particle. It has been 
translated as a tag question by Miyara (2015), but may also be a plain polar question 
marker. 


(130) Shuri 
kamadee=ga mango tfuku-ta-n=naa? 
PN=NOM mango grow-PST-IND-Q 


‘Kmadee grew mangoes, didn’t he?’ (Miyara 2015: 394) 
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But Shuri also has an interrogative verb morphology. In some cases it is not entirely 
certain that we are not dealing with enclitics instead, but for purposes of comparison 
all forms have been given as suffixes. Similar to Yuwan, there is a polar question suffix 
-mi, but content questions take the suffix -ga. According to Uemura (2003: 95), as well 
as (Arakaki 2003: 181f.), however, the actual question marker for polar questions is -i 
and the -m originally was an affirmative, declarative, or indicative marker that has the 
form -n in other contexts. According to Arakaki (2010), the suffix -n is an evidential 
marker for “direct evidence”. As opposed to Yuwan, which uses the plain negative -an 
and the interrogative negative -ami, Shuri retains its original form in the negative, i.e. 
-(r)an-i. While in Yuwan the new polar question marker simply attaches to the past tense 
form (-ta-mi), Shuri has an amalgamated form -ti(i) that in all likelihood goes back to 
a combination of the past tense marker -ta and the interrogative -i. However, Uemura 
(2003: 145) and Arakaki (2015: 67) seem to suggest a combination of the past participle 
and the question marker instead. In content questions, -ga takes the last position, is fully 
analyzable, and always replaces the indicative ending wu Table 5.42 gives an overview 
of Shuri verb forms with a focus on interrogative verb morphology. 


Table 5.42: Shuri verb forms illustrated with the verbs 'uki- ‘to wake up’ and 
kac- ‘to write’ according to Arakaki (2003: 180f., passim); partly reanalyzed (cf. 
Uemura 2003) 


Vowel stem Consonant stem 
PRS.PTCP ‘uki-i kac-i 
NPST-IND ‘uki-ju-N kac-u-N 
NEG (NPST) ‘uki-ran kac-aN 
NPST.NEG-Q (PQ) ‘uki-ran-i kac-an-i 
NPST-IND.Q (PQ) ‘uki-ju-mi kac-u-mi 
NPST-Q (CQ) ‘uki-ju-ga kac-u-ga 
NPST.NEG-Q (CQ) ‘uki-ran-ga kac-an-ga 
PST.PTCP ‘uki-ti kac-i 
PST-IND ‘uki-ta-N kac-a-N 
NEG-PST-IND ‘uki-ran-ta-N kac-aN-ta-N 
NEG-PST.Q (PQ) ‘uki-ran-ti kac-an-ti 
PST.Q (PQ) ‘uki-ti(i) kac-i(i) 
PST-Q (CQ) ‘uki-ta-ga kac-a-ga 
NEG-PST-Q (PQ) "uki-raN-ta-ga kac-aN-ta-ga 


Uemura (2003: 95) furthermore mentions the partly suppletive copula forms ja-n (af- 
firmative), ja-mi (polar question), and 7a-ran-i (negative polar question). Consider some 
examples with interrogative verb morphology. 
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(131) Shuri 
a. kamadee-ga mangoo tfuku-ju-mi? 
PN-NOM mango grow-PRS-Q 
"Will Kamadee grow mangoes?' 
b. taa-ga | maggoo tfuku-ta-ga? 
who-NoM mango grow-PST-Q 


"Who grew mangoes?' (Miyara 2015: 393, 394) 


Whether the suffix -i(i) seen above has to be differentiated from the particle =ji found in 
focus questions, remains unclear. 


(132) Shuri 
kamadee-ga-du mango tfuku-ju-ru-ji? 
PN-NOM-FOC mango grow-PRS-NIND-Q 


'Is it Kamadee who grows mangoes?' (Miyara 2015: 394) 


According to the description by Arakaki (2003: 181f.), the question marker -i(i) attaches 
directly to the verb stem and replaces the usual past tense ending -a-n. 


(133) Shuri 


a. waN-nee tigami kac-a-N? 
1sG=TOP letter write-PST-IND 
‘I wrote a letter? 

b. fjaa-ja tigami kac-ii? 
2sc-TOP letter write-Q 


"Did you write a letter?' (Arakaki 2003: 181) 


However, if -ti(i) indeed stems from -ta + -i (or -ti + -i), perhaps -i(i) can be analyzed as 
-a + -i (or -i + -i). The occasional long vowel (-tii, -ii) in Arakaki's (2003) and Uemura's 
(2003) data might be a reflex of this. In (132) above, the focus marker -du requires the 
non-indicative ending -ru on the verb. The Yuwan verbal ending -ui—combined with =du 
as well—possibly contains a cognate of Shuri -ji (or perhaps -i(i)). Content questions 
in Yuwan only have the ending -v. In Shuri, if the focus marker -ga is present, again 
identical to the question marker in content questions, the verb takes the question or 
dubitative marker -ra. This pattern can be found in both content and focus questions. 
See below on kakari musubi for further information on this phenomenon. 


(134) Shuri 


a. nuu ífi-yu-ga? 
what wear-PRS-Q 


"What do you wear?’ 


178 


5.6 Japonic 


b. nuu-ga  tfi-yu-ra? 
what-roc wear-PRS-Q 


"What do you wear?' (Nagano-Madsen 2015: 204) 


c. kamadee-ga-ga mangoo tfuku-ju-ra jaa? 
PN-NOM-FOC mango grow-PRS-Q Q 


‘Is it Kamadee who will grow mangoes?’ (Miyara 2015: 394) 


Apart from all the different forms mentioned, the last example has yet another particle 
jaa, originally glossed as ‘I wonder’, that can also appear as a part of the complex form 
ga-jaa. As noted above, it may be related to the form ya in Old Japanese. The first element 
is unlikely to be the content question marker -ga because gajaa can also appear in polar 
questions. The description is insufficient to give a good summary here but (ga)jaa appears 
in focus and content questions. 


(135) Shuri 
kamadee=ja nuu | tfuku-ju-gajaa? 
PN-NOM what grow-PRs-Q 


‘Kamadee is going to grow what?’ (Miyara 2015: 395) 


As opposed to other Ryükyüan languages the marker -ka does not mark neutral ques- 
tions but rather suggestions. 


(136) Shuri 
fari-ga  ?i-i-fe=e tfik-an-ka? 
3SG=NOM say-PRS-N-TOP? listen-NEG-SGS 
‘Shall we not listen to him?’ (Miyara 2015: 395) 


Intonation in Shuri is exceptionally well described and too complex to go into every 
detail here (see Nagano-Madsen 2015). Several important points have been summarized 
as follows: 


In Japanese, both yes-no and wh-questions are accompanied by final rising pitch. 
In Okinawan, neither yes-no questions nor wh-questions are accompanied by fi- 
nal rising pitch. Like a yes-no question, Okinawan wh-question has intonation 
composed of its lexical accent type unless the verb is immediately preceded by a 
wh-word. When a verb is immediately preceded by a wh-word, the lexical accent 
of the verb is usually strongly compressed or rather deleted. [...] 


Although the most usual form of forming interrogatives in Okinawan is with a 
mood suffix, it is not impossible to make an utterance that has (declarative) indica- 
tive mood suffix +N, which is produced with a final rising pitch. Furthermore, it 
is quite common to form an interrogative with the sentence-final question particle 
na, which is also produced with a final rising pitch. (Nagano-Madsen 2015: 209) 
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Tsuken (spoken on Tsuken island close to Okinawa) has a polar question marker -mi 
that probably is related to the marker -mi in Shuri or -mi in Yuwan. At first glance, the 
question marker replaces the declarative ending in the following examples in a non-past 
tense. But in fact, -mi must goes back to *-n-i as in Shuri and Yuwan. 


(137) Tsuken 
a. fjaa-ga kak-u-n. 
2SG=NOM write-NPST-DECL 
"You will write’ 
b. Zjaa-ga kak-u-mi? 
2SG=NOM write-NPST-Q 
"Will you write?' (Matayoshi 2010b: 92) 


But there is also a cognate of the marker -na ~ =nja in Yuwan and other languages that 
enclitically attaches to the sentence. It does not replace the declarative marker but rather 
attaches to it. 


(138) Tsuken 
kuruma-kara si-sa-n-na? 
car=ABL come-PST-DECL=Q 


‘Did you come by car?’ (Matayoshi 2010b: 102) 


The distribution between the two markers also remains unclear in Tsuken but probably 
is connected to the verb ending. Content questions have a marker =ga that, as in Shuri, 
looks suspiciously similar to the focus marker =ga (Matayoshi 2010b: 102). A connection 
with the nominative/genitive =ga seems unlikely. 


(139) Tsuken 
taa-ga | sa-ga? 
who=NoM do=Q 


"Who does?’ (Matayoshi 2010b: 94) 


There is no example in which the plain focus marker =ru is found in a question, which 
does not mean, however, that this is impossible. The same is true for the focus marker 
=du in the language Tarama. 

Tarama (spoken on Tarama and Minna among the Miyako islands) otherwise has a 
straightforward pattern with =na found in polar questions and =ga in content questions. 
Again, the optional focus marker in content questions is identical in form with the ques- 
tion marker. 


(140) Tarama 


a. kure=e kam=nu sima=na? 
this=Top god=cEn island=Q 


‘Is this an island of god?’ 
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b. naa=ju=ba nuu=ti=ga li-ga? 
name=ACC=TOP what=QUOT=FOC say=Q 
‘What is your name?’ (Aoi 2015: 417) 


There are also examples where there is only one marker with the form =ga. Aoi glosses 
the form as question, but it might well be the focus marker. 


(141) Tarama 
nuu-ga si-tar? 
what-?o do-PsT 


"What happened (with you)?' (Aoi 2015: 419) 


Ikema (spoken on Ikema, Irabu, and Miyako among the Miyako islands) also has the 
two question markers =na (PQ, FQ) and =ga (CQ). But, as opposed to Yuwan, for instance, 
the focus marker =du appears not only in focus but also in content questions. 


(142) Ikema 
a. husi-nu-du  mii-rai | ui-na? 
star=NOM=FOC look-POT CONT.NPST=Q 
‘Can you see the stars?’ 
b. nau=nu=du mii-rai  ui-ga? 
what-NoM-roc look-POT CONT.NPST=Q 
"What can you see?’ (Hayashi 2010b: 173) 


The Hirara dialect of Miyako has yet another distributional pattern. According to 
Koloskova & Toshio (2008: 620), there is a distinction between three focus markers, 
namely =ga in content questions, =nu in polar questions, and =du in declaratives. In 
Ikema a special question marker for topic questions is =da, which is always combined 
with the topic marker. In Masana (Okinoerabu) the question marker =do can also be 
combined with the topic marker =wa (van der Lubbe & Tokunaga 2015: 362). 


(143) Ikema 
vva=a=da? 
2SG=TOP=Q 
‘How about you?’ (Hayashi 2010b: 173, fn. 16) 


Questions in Ogami (spoken on Ogami next to Miyako and in one village on Miyako 
itself) have a pitch that “is high and level and falls sharply on the last syllable” (Pellard 
2010: 146). Similar patterns may exist for other Ryükyüan languages but usually were not 
stated as clearly. There are two optional question markers, a by now familiar particle -ka 
and a suffix -ee that “is limited to past tense forms, the copula and stative verbs" (Pellard 
2010: 151). It may be worth noting that it is identical to a suffix that derives agent nouns 
(Pellard 2009: 118) and we might be dealing with a development parallel to Japanese no.” 


For the following examples only Pellard (2009) in French was quoted, but they can usually also be found 
in Pellard (2010) in English. 
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(144) Ogami 
nauripa-tu kuu-tatar-ee? 
why=FOC  come-PST.NEG-Q 


‘Why didn’t you come?’ (Pellard 2009: 211) 


I was unable to find a good example for the sentence-final particle =ka in Pellard (2009; 
2010). The only example is an embedded content question. 


(145) Ogami 
[nau-iu as-sipa=tu  tau-katw=ka] ss-ai-n? 
what=acc do-circ=Foc good-v-Q know-PoT-NEG 
‘I don't know [what I should do]: (Pellard 2009: 225) 


There is a special marker mukara for embedded polar questions comparable to English 
if /whether or German ob (Pellard 2009: 221). The focus marker -tu is sometimes found 
attached to a verb as well and we might be dealing with a development of a question 
marker as in Irabu, but Pellard (2009: 192) is not very clear about this. 


(146) Ogami 
vva-a  pssnii-pa asi-tu? 
2SG=TOP siesta=TOP.OBJ do=?Q 


‘Have you taken a siesta?’ (Pellard 2009: 221) 


Questions in the language Irabu (spoken on Irabu among the Miyako islands) exhibit 
an interesting interaction with focus marking. According to Shimoji (2011a: 118), 


when a focus marker is present, a question marker is optional, and its form is iden- 
tical to that of the focus clitic in the same clause. I treat these two (i.e., the focus 
marker and question marker) as different morphemes owing to the fact that they 
show different allomorphic patterns, even though the focus marker may be the 
historical source of the question marker. 


If only a question marker is present, it attaches sentence-finally to the verb. This is a 
plain polar question. 


(147) Irabu 
vva-a  uri-u az-tar-ru? 
2sG- TOP that-Acc say-PST=Q 


"Did you say that?' (Shimoji 2011a: 119) 


In the following two examples both focus and question markers appear. The first example 
is a focus question, the second a content question. 
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(148) Irabu 


a. vva-ga-ru uri-u az-tar-ru? 
2SG=NOM=FOC that-Acc say-PST=Q 
‘Did you say that?’ 

b. vva=a  nau-ju-ga az-tar=ga? 
2sc-TOP what=ACC=FOC say-PST=Q 
"What did you say?’ (Shimoji 2011a: 118) 


The two markers -ru and -ga are probably cognate with Yuwan -du and -ga, where 
they express only focus. The fact that the question markers are optional if the focus 
marker is present, might be a hint of the historical development. Presumably, the focus 
marker =ru was reanalyzed as a question marker in focus questions and subsequently 
also marked polar questions. From there it may have spread back to focus questions in 
its new position attached to verbs. But in the absence of any historical data, this scenario 
must remain speculative. Shimoji (2011a) has one example of an embedded alternative 
question that shows double marking and no disjunction. 


(149) Irabu 
[ssibara=ru a-tar-xru | maibara-ru a-tar=ru] mmja s-sa-n-O=suga 
back-roc cor-Psr-ofront-roc COP-PST=QINTJ know-THM-NEG-NPST=but 
‘But I’m not sure [whether (the house) was behind or in front]? (Shimoji 2011a: 
132£) 


The presence of the focus marker in Irabu excludes realis marking on the verb (see be- 
low on kakari musubi). Lawrence (2012: 396) briefly mentions question marking in the 
Nakachi variety of Miyako, which shows a somehow reminiscent pattern. In content 
questions there is only the marker -ga on the interrogative itself while polar questions 
have the focus marker -ru, exclusively. In polar questions the slightly different -ro is 
found sentence-finally. 

Hateruma is the name of one of the Yaeyama islands but as usual is also used to 
refer to the language spoken there Aso (2010a; 2015). Due, however, to relatively recent 
population movements, the language is also spoken on another Yaeyama island, namely 
Ishigaki. Hateruma has four inferential suffixes =kaja, =sa, =dore, and =paci, the first 
three of which may correspond to the forms found in Okinoerabu above, i.e. =kaja, =sa, 
-do (Matayoshi 2010a: 208). But polar questions are also expressed with the enclitic =naa 
while content questions remain unmarked. This is a rather untypical pattern for a Japonic 
language but is the norm in most other languages in Northeast Asia (see Chapter 6). 


(150) Hateruma 


a. da=0 sinsin-naa? 
2sG=(CORE) teacher=Q 


"Are you a teacher?’ 
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b. kuri=@O=ja nu ja-O? 
this=(CORE)=TOP what COP-NPST 
‘What is this?’ (Matayoshi 2010a: 210) 


Often an enclitic such as =ba is found in content questions, but this has instead an em- 
phatic or focus function. 

Hatoma is another Yaeyama variety. While Hateruma is a small island south of the 
main island Iriomote, Hatoma is an even smaller island on the north of it (Matayoshi 
2010a: 189). Hatoma exhibits an interesting split between past and non-past content ques- 
tions (Lawrence 2012: 396), the former, like polar questions, being marked by (probably 
rising) intonation alone and the latter showing a second split. Non-past content ques- 
tions usually have an attributive form of a verb followed by the marker -wa. But if an 
interrogative phrase stands sentence-finally, it takes the marker -ja, instead. Apparently, 
the difference lies in the clause type with either a verbal or a non-verbal predicate. Con- 
tent questions thus have three different markings. 


(151) Hatoma 

a. nunti kanan=wa? 
why  write.NEG-Q 
"Why won't wou write?' 

b. kjuu=ja  nuw-nu fpii-ja? 
today=Top what-GEN day=Q 
‘What day is it today?’ 

c. nuusi nat-taa? 
how become-pst 
"What happened?' 

d. waa aca-N k-ii ffir-un? 
2sG tomorrow-also come-INF give.me-AFF 


"Will you come tomorrow, too?’ (Lawrence 2012: 396) 


Descriptions of Ryükyüan languages almost never give information on other question 
types such as alternative questions, Lawrence (2012: 397) being an exception. Hatoma 
alternative questions either display simple juxtaposition or double marking with the 
form -kajaa. 

(152) Hatoma 


a. kuree turu kaburee? 
us rop bird bat 


‘Is this a bird or a bat?’ 


b. kuree  turu-kajaa kaburee-kajaa? 
this.roP bird=Q bat-Q 


‘(I wonder) is this a bird or a bat?’ (Lawrence 2012: 397) 
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Cognates of the marker =kajaa were already encountered in Okinoerabu and Shuri. In 
Hatoma it can also be found in (less direct) content questions such as taa=kajaa? ‘(I 
wonder) who (is it)?’ (Lawrence 2012: 396). 

The last Yaeyama variety to be considered here is called Miyara or Miyaran, spoken 
on Ishigaki island (Izuyama 2003; Davis & Lau 2015). In Miyara both polar and focus 
questions may be expressed with the help of rising intonation alone. In focus questions 
an additional focus marker =du appears and triggers the loss of the indicative ending on 
the verb. 


(153) Miyara 
a. naoja=ja sinbun jum-u-n? 
PN-TOP newspaper read-PRS-IND 
"Will/does Naoya read the newspaper?' 
b. naoja-ja sinbun=du jum-u? 
PN-TOP newspaper=FOC read-PRS 
"Will/does Naoya read the newspaper?’ 
c. naoja-ja noo-du  jum-u? 
PN-TOP what-roc read-Pns 


"What does/will Naoya read?' (Davis & Lau 2015: 260) 


Content questions have the same (optional) focus marker but exhibit falling intonation. 
Notice the absence of the final -N from content questions even if the focus marker =du 
is not present. 


(154) Miyara 
zima-ge  har-u? 
where-DIR go-PRS 


"Where are you going?' (Davis & Lau 2015: 261) 


Miyara also has the dubitative particle kajaa as well as a particle i that "indicates a 
request for agreement" (Izuyama 2003: 28f.). Details remain unclear, but the latter might 
be comparable with =ji in Shuri. 

Yonaguni is the westernmost island of the Yaeyama islands, only about 100 km off 
the coast of Taiwan. Here only two Yonaguni dialects will be addressed, Dunan and 
Sonai. In Dunan polar and focus questions are marked with a sentence-final clitic =na. 
Content questions have their own sentence-final marker -nga. There is an additional 
focus marker in focus (=du) and content questions (=ba). A non verbal content question 
has the question marker -ja instead of -nga. 


(155) Dunan 


a. khuruma mut-i bu-na? 
car hold-MED IPFV=Q 


‘Do (you) have a car?’ 
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b. suuti khat-i=du bu-ru-na? 
book write-MED-FOC IPFV-PTCP-Q 
‘(Are you) writing a book?’ 
c. nda-ja tharu-nkinu(-ba) | thura-nga? 
2SG=TOP PN-DIR  what(-roc) give=Q 
"What will you give to Taro?' 
d. nma=na(=ba) dunan-ccima-ja? 
where-NoM(-roc) PN-island=Q 
"Where exactly is Yonaguni island?' (Yamada et al. 2015: 468, 466, 469) 


Whether =ja might be connected to Okinoerabu and Ura =joo remains unclear to me. 

In Sonai the situation is very similar to Dunan (Izuyama 2012: 442ff.). The polar ques- 
tion marker has the form -na(i) and content questions have two different markers with 
the same distribution, =ga in verbal and =ja(a) in non-verbal clauses. In addition, there is 
a dubitative form =kaja(a) roughly meaning ‘I wonder’ as in Hatoma and other varieties. 
The two elements -du and -ba obviously correspond to Dunan -du and -ba. Izuyama 
(2012: 443) calls them focus and selective particles but writes them attached to the pre- 
ceding word with the help of a hyphen. The question markers on the other hand were 
written detached from the preceding word. I reanalyze all of them as enclitics. 


(156) Sonai 
a. num-i-na? 
drink-IND=Q 
‘Have you drunk it?’ 
b. Nma-nki_ h-jun=ga? 
where-ALL g0-PFV=Q 


"Where has (she) gone?’ 


c. u-ja nu-ja? 
this-roP what=Q 
"What is this?’ 
d. nu-ba nda munu-ja? 


what-sEL 2sG.GEN thing=Q 
"Which is yours?' 

e. Nda ici-ba ` s-uN-ga? 
2sc when=SEL come-PFV=Q 


‘When did you come?’ 


f. ta-ba-du tata-N-ga? 
who=sEL=Foc make.stand-coNc=Q 


‘Whom do you make stand?’ 
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g. nu=ba=du ut-iru=kaja? 
what=sEL=Foc fall-conc=Q 


‘I wonder which one will fall down?’ (Izuyama 2012: 439, 419, 425, 444, 421) 


As in Ikema, the focus marker =du is also found in content questions and is not restricted 
to focus questions as in Yuwan. 

Table 5.43 summarizes the marking of questions in Japonic languages. Given the lack 
of information on alternative questions, these have been excluded from the summary. In 
general, it appears that alternative questions show the double marked type and lack a 
disjunction. Forms with an additional semantic component such as those translated with 
‘I wonder’ are excluded from the list as well. 

Most languages have different markers for polar and content questions. Ogami and 
Japanese are exceptional in allowing the same marker. Apart from Hateruma and Yilan 
Creole all languages have content question markers. Little information is available on 
focus questions. In some languages such as Dunan, Ikema, and Japanese they have the 
same marking as polar questions, plus an additional focus marker. In Yuwan and Shuri 
there are special question markers, but Shuri also allows the question and focus markers 
from content questions to enter focus questions. The only languages without at least an 
optional polar question marker are Hatoma and Miyara. 

A typologically rare phenomenon of Japonic languages that is relevant for interroga- 
tive constructions is a kind of focus concord, usually called kakari musubi (KM) ‘govern- 
ing (and) concordance’ (cf. Shimoji 2010: 11; Shinzato & Serafim 2013). We have already 
encountered a special type in Yuwan above that is limited to interrogative constructions. 
Specifically, the focus markers =du in focus questions and =ga in content questions nec- 
essarily are followed by the verb endings -ui and -u, respectively. Usually, however, the 
phenomenon is not restricted to questions but can also be found in declarative sentences. 
More generally, kakari musubi can be characterized as “a syntactic agreement construc- 
tion in which specific particles called kakari joshi (kakari particles, KP henceforth) cor- 
relate with particular predicate conjugational endings other than regular finite forms to 
end a sentence.’ (Shinzato 2015: 299) 

KM is attested in some Ryükyüan languages as well as Old Japanese, but not in mod- 
ern Japanese. Altogether, Japonic has five different kakari particles, of which we have 
already encountered ka and ya. In Old Okinawan only three of them have clear cognates 
(Table 5.44). The first three of the markers may go back to demonstratives (cf. pre-modern 
Japanese demonstratives ko-, so-, ka-, see §5.6.3). According to Shinzato (2015), kakari 
musubi is similar to an it-cleft construction, i.e. a way of marking focus. This may be 
the reason why the kakari particles are also found in focus as well as content questions. 
The verbal ending triggered by the KP is usually an adnominal form. Modern Ryükyüan 
languages nevertheless show several deviations from this rule. In Miyara and Irabu, for 
instance, there is no adnominal form of the verb (Davis & Lau 2015: 257). In Miyara the 
presence of the focus marker leads to the loss of the indicative ending (see also example 
153 above). 
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Table 5.43: The marking of polar, focus, and content questions in Japonic; 
whether or not a focus marker is optional is not indicated 


PQ CQ FQ 
East OJ ya, ka ka ATTR + ya, ?ka 
West OJ ya, ka ka ATTR + ya, ?ka 
Hachij6 ?kai ?- ? 
Japanese ka#, nost ka# ka# + wa TOP 
Yilan Creole gas, no# - ? 
Ura =na# ~ =nja# =joo# ? 
Yuwan V-mi, =na# V-u + =ga FOC V-ui + =du FOC 
(Table 5.41) 
Okinoerabu =na#, IND -n, -Ø + =joo# ? 
=nja#/-jee# + PST -ti 
instead of -ta 
Shuri V-mi, V-i, V-ti, V-ga, V-ra + =ga =ji# + =du Foc, V-ra + =ga FOC 
=naa# (Table 5.42) FOC 
Tsuken V-mi, =na# -gaf 
Tarama -nas =ga# + =ga FOC 1 
Ikema =na# =ga# + =du FOC =na# + =du FOC 
Ogami =ka#, ?=tu, PST, cop, =ka#, PST, COP, ? 
STAT.V -££ STAT.V -££ 
+ =du Foc 
Irabu =ru# + =ru FOC =ga# + =ga FOC ? 
Hateruma =naa# e ? 
Hatoma - PST -, ATTR + -Wa, ? 
=ja# (non-verbal) 
Miyara - lack of IND -N + =du lack of -N + =du Foc 
FOC 
Dunan =na# =nga#, =ja# =na# + =du FOC 
(non-verbal) 
+ =ba Foc 
Sonai =na(i)# =ga#, =ja(a)# ? 


(non-verbal) 
+ =ba SEL, + =du FOC 


Table 5.44: KPs in Proto-Japonic, Old Japanese, and Old Ryükyüan according 
to Shinzato (2015: 306ff.) 


Proto-Japonic Old Japanese Old Okinawan 
GroupI *k6(swo) koso su 

"tyó so do 

"ka ka ga 
Group II ? ya ?yi 

? namu S 
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(157) Miyara 
a. naoja-ja sinbun jum-u-n? 
PN-TOP newspaper read-PRS-IND 
‘Naoya will read/reads the newspaper’ 
b. naoja-ja sinbun=du jum-u? 
PN-TOP newspaper=FOC read-PRS 
‘Naoya will read/reads the newspaper? (Davis & Lau 2015: 260) 


The phenomenon found in Irabu has been called quasi-kakari musubi (Shimoji 2011b). 
Instead of the obligatory presence of a certain verb ending (usually adnominal), Irabu not 
only excludes the presence of realis marking but allows other types of endings (including 
irrealis, mood-neutral etc.). 


(158) Irabu 


a. ba-a kuruma-u-du vv-tar. 
1sG-TOP car-ACC-roc  sell-psT 


‘I sold a car? 


b. *ba-a  kuruma-u-du vv-tam. 
1sG=TOP car-ACC-roc  sell-psT 


‘I sold a car? (Shimoji 2011b: 120) 


Shimoji (2011b: 121) calls these two different types positive and negative concordance. 
For a phenomenon similar to kakari musubi in NEA see §5.14.2 on question marking in 
Yukaghiric. 


5.6.3 Interrogatives in Japonic 


Interrogatives in Japonic languages are not very well described. Most descriptions avail- 
able to me simply mention one or two forms but do not dwell on their analysis, etymol- 
ogy, or usage. The major exception in the Western literature is Vovin (2005: 297-336). 
Some interrogatives such as ‘who’, ‘what’, and ‘when’ are probably of Proto-Japonic ori- 
gin (Table 5.45). 

These forms represent three major groups of interrogative present in Japonic lan- 
guages that start with *t-, *n-, and *e-, respectively. Japonic has neither KIN- nor K-inter- 
rogatives. The Proto-Japonic interrogative *ta- ‘who’ is basically present in all Japonic 
languages. Written pre-modern Japanese still had ta-re instead of modern day da-re (As- 
ton 1904: 63). Yilan Creole has an initial liquid instead (/a-re). In some languages the base 
stem is used as interrogative while other languages exhibit different suffixes. The suffix 
-re in Japanese and its equivalents in some of the other languages is probably related to 
the suffix found in do-re ‘which’ as well as the demonstratives (see below). Its meaning 
is somewhat unclear but it may be treated as a stem extension. 


16This form is rare and probably originates in the Western dialect. 
U'Hachijo data taken from http://www008.upp.so-net.ne.jp/ohwaki/hougen.htm. (Accessed 2016-01-19.) 
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Table 5.45: Japonic interrogatives for ‘who’, ‘what’, and ‘when’; many South- 
ern Ryükyüan forms stem from Bentley (2008a: 298-299); EOJ = Eastern Old 
Japanese, WOJ = Western Old Japanese, PMJ = pre-modern Japanese, OR = 
Old Ryükyüan (Vovin 2005; Kupchik 2011); the N stands for prenasalization; 
transcription of Shodon glottal stop modified 


who what when 
EOJ ta-re £l, SL, ... ?aN- Z, ... < *ani (nani ZS A8 — itu, 
ta-(N)ka 2B, In. .…. 
Hachijo!” da-re Zh, ani HI ? 
da-ga 72% 
WOJ ta-(re) Sis, £t, ... nani ZER. ABE, ... itu ap 
ta-Nka $B, fib £i, ... 
PMJ ta-re, (da-re), ta-ga nani itsu 
Japanese da-re &£, 724v nani (~ nan) AJ, lc itsu 4 BF, OD 
Yilan Creole  la-re nani ? 
Ura ta-ru nan icu 
Yuwan ta-ri - ta-ru nuu ici 
Okinoerabu ` ta-ru ~ ta-1) nuu Titfi 
Shodon tha-r(u-), thaa-ga nu(u) ~ nu(u) Zeit?) 
OR ta(a) 7-45, ta-ru KS, nau RE itu ?[itsi] V2 
ta-ga in 
Shuri taa nuu Titfi 
Tsuken taa ? ? 
Hirara ta-ru ~ too noo itsi 
Tarama taa-ga, tau nuu itsi 
Ikema ta-ru nau ? 
Nagahama ta-ru nau itsi 
Ogami ta-ru ~ tau nau iks 
Irabu ta-ru nau ic 
Ishigaki ta-ru - taa noo itsi 
Kohama ta-ru nuu itsu 
Kuroshima ta-ri ~ taa nuu itfiya 
Hateruma ta-ru - ta(a) nu(u) ici 
Hatoma taa nuu itsi 
Miyara ta-ru noo itsi 
Dunan thá nt ? 
Sonai ta(a), ta-a, takka nu(u) ici 


The suffixes -Nka in Old Japanese and -ga in Old Ryükyüan are said to have a pos- 
sessive function (Vovin 2005: 298ff.). Hachijo -ga, Shodon -ga, Tarama -ga, Okinoerabu 
-ņ, and Sonai -ņa are likely of the same origin. It may be worth noting, however, that 
in these languages the suffix combines the function of both the genitive as well as the 
nominative (Izuyama 2012: 417; van der Lubbe & Tokunaga 2015: 352; Aoi 2015: 415). 
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The Proto-Ryükyüan interrogative meaning ‘what’ probably had the form * nau. Forms 
such as Miyara noo have gone through regular sound changes, in this case "ou > oo 
(cf. Davis & Lau 2015: 258). But the connection with Japanese nani or Hachijo ani is 
not completely straightforward. At least one Ryükyüan language has a form closer to 
Japanese (Ura nan), but this may be due to contact with Japanese. To my knowledge, 
the best, albeit problematic, explanation has been put forward by Vovin (2005: 305-313). 
He reconstructs a Proto-Japonic form *nanu, in which the *n- is said to be a prefix with 
unclear meaning. Eastern Old Japanese, Vovin claims, has a form without the prefix, as 
can be seen from a comparison of WOJ naNtó, naNsó and EOJ aNtó, aNse. According to 
Vovin, the final -i might derive from a suffix -(C)i, the meaning of which was not given. 
He assumes an irregular sound change in Ryükyüan , namely the loss of the intervocalic 
n, resulting in "nau. Vovin (2005: 313) also notices a similarity of his reconstruction with 
Austronesian *n-anu with an unclear prefix. Blust (2013: 310) reconstructs the Proto- 
Austronesian form as "anu ‘what’, and we will encounter the Atayal form nanu’ ‘what’ 
at the end of this section. The similarity is indeed striking, but depends on whether Proto- 
Japonic *nanu is a correct reconstruction or not. 

However, Vovin's explanation does not seem very plausible. For example, instead of 
postulating an otherwise unknown prefix n-, it is much more likely that Eastern Old 
Japanese simply lost the initial nasal that is present in Ryükyüan as well. Let us first 
consider the Japanese forms naze and nado meaning ‘why’. According to Vovin (2005: 
333) they have the form naNsó and naNtó in (Western) Old Japanese and are combina- 
tions of nani with the two defective verbs tö ‘to say’ and sö ~ se ‘to do’ (or a particle 
sö). Given the strong connection of the categories of REASON and ACTION, this seems 
plausible. Vovin (2005: 336) claims that Ryükyüan has no cognates of the two forms, and 
indeed, of the references used in 85.6.2 only Shimoji (2011a: 106) mentions the two forms 
nausi ‘how’ and nautti ‘why’ for Irabu. Bentley (20082: 268, 298f.) gives some additional 
forms (e.g., Hirara noofii ‘how’, nooti ‘why’) and reconstructs Southern Ryükyüan (Sak- 
ishima) *naWo-se ‘how’ and *naWo-nVte- ‘why’. The W stands for a somewhat unclear 
semi-vowel *j or * w (Bentley 2008a: 218f.). Apart from certain innovations and additional 
suffixes found in some languages, there certainly are cognates of the Japanese interroga- 
tives. Ryükyüan forms such as Irabu nau-si suggest a derivation that is directly based on 
nau ‘what’ and the same may be true for the Old Japanese equivalents, i.e. they might 
be derived from *nanu instead of nani. The nasal found in some forms such as Yonaguni 
nundi, according to Bentley, was part of the suffix instead of the stem (also cf. Shuri 
nuuntf i ‘why’, Miyara 2015: 387). 

The interrogative ‘when’ can be reconstructed as "etu (Vovin 2005: 330, see Pellard 
2008: 143, passim for details on vowels). The interrogative can be found in all Japonic 
languages for which sufficient material is available. The analysis of PJ “etu is an open 
question but it can be classified with several other interrogatives with the resonance *e- 
» i- (Table 5.46). WOJ in addition has the forms iNtu-ti ^where' as well as iku-Nta 'how 
many/much' and EOJ iNtu-si ‘which’. 

Several scholars have compared the interrogatives in *e- with Koreanic "e- (e.g., Frelles- 
vig & Whitman 2004: 289; Vovin 2005: 319, 322; 85.7.3). However, a comparison based 
on one vowel must be treated with caution. 
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Table 5.46: Interrogatives in Proto-Japonic, Western Old Japanese (WOJ), East- 
ern Old Japanese (EOJ), pre-modern Japanese (PMJ), Japanese (J), and Proto- 
Ryükyüan (PR) starting with i~ < “e~ according to Aston (1904: 63ff.), Vovin 
(2005: 297ff.), and Kupchik (2011: 589ff.); partly modified transcription; the N 
stands for prenasalization 


PJ WOJ EOJ PMJ J PR 
which *entu-re iNtu-re itu-re idzu-re do-re *edu-re 
when *etu itu itu itsu itsu *etu 
how *eka ika ika ika ika *eka 
how *eku iku- ? iku- iku *eku 


many 


The Old Japanese interrogative ika ‘how’ is not very common, is usually limited to 
Western Old Japanese and is followed by one of the defective copulas n- and tó- or the 
still more productive nar-, which is a contraction of n-i ar- 'CoP-INF exist-' (Vovin 2005: 
313-319; Kupchik 2011: 593£). Among the cognates in Ryükyüan languages we find Old 
Ryükyüan forms such as ika VV, ikya V ~ ka 7^, kya 9 etc. and Shuri ‘icaa ~ caa 
(see Vovin 2005: 318 for a more exhaustive list). In both cases there are forms with and 
without the initial vowel that is responsible for the palatalization of the following velar 
consonant. Vovin's (2005: 317) problematic and somehow unclear conclusion is that the 
interrogative has to be analyzed as “e-ka. But this is no explanation for why the initial 
element—which must be considered the interrogative as such—can simply be omitted. 
It is more reasonable to assume that ika was considered an inseparable interrogative 
by the speakers, which is why the, maybe irregular, loss of the vowel did not affect 
its interrogative status as such. The same criticism also applies to his explanation of the 
other interrogatives that will be addressed in the following. Japanese ikaga ‘how’ derives 
from the Old Japanese fixed expression ika n-i ka ‘how cop-1nF Oo (Vovin 2005: 314, fn. 
120). Vovin (2005: 319) compares the hypothetical element -ka with Korean but leaves 
open any further detail. 

Apart from the locative endings, the Old Japanese interrogative iNtu-ku ‘where’ has 
a direct cognate in Old Ryükyüan idu-ma > zuma ZZ as well as in modern Ryükyüan 
languages such as Miyara zima (Vovin 2005: 321; Davis & Lau 2015: 261). The second part 
-ma is claimed to be a noun meaning ‘place’, but in this case the interrogative idu- would 
be expected to have the meaning ‘what’ or ‘which’ rather than ‘where’. In fact, from a 
typological perspective PJ “entu (together with the extended form "entu-re) likely was 
a selective interrogative ‘which’ at first and only later developed into a locative inter- 
rogative ‘where’, as it was combined with a locative marker or a noun meaning ‘place’ 
(-ma). Several languages of the region have parallel developments and this scenario is 
corroborated by data from some Ryükyüan languages such as Irabu nzi ‘which’ versus 
nza ‘where’ that may go back to the plain and derived forms, respectively. Ogami still has 
the non-palatalized forms nti (~ iti) versus nta (~ ita) that make this development seem 
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more plausible. However, Ryükyüan languages show much stronger variation in forms 
meaning ‘where’ than in those interrogatives previously encountered. Some of Vovin's 
(2005: 321, especially fn. 123) otherwise good explanations for those deviations are some- 
what speculative and cannot be taken at face value. Among the dialects mentioned in 
85.6.2, for example, we find the forms listed in Table 5.47. A possible explanation for 
Shuri maa is the loss of the first part of idu-ma. All other forms can, following Vovin, be 
derived directly from idu-ma or rather its predecessor PR *eNtuma (Vovin 2005: 321, fn 
123). But this is certainly not true for Yuwan daa, in which the first part was deleted as 
well (cf. Okinoerabu ?uda). 


Table 5.47: Interrogative forms meaning ^where' in Japonic 


Language Form 

Eastern Old Japanese iNtu- FE 
Western Old Japanese iNtu-ku FEA 
(written) pre-modern Japanese idzu-ko 
Hachijo do-ko 
Japanese do-ko {aj 
Yilan Creole do-ko 

Ura ?uda 

Yuwan daa 
Okinoerabu ?uda 

OR idu-ma > zuma TH 
Shuri maa 

Tsuken maa 

Ogami nta ~ ita 

Irabu nza 

Hateruma za 

Miyara zima 

Dunan nma 

Sonai Nma 


Vovin mentions a Northern Ryükyüan form raa, not encountered thus far, that is prob- 
ably a variant of daa. The distinction between LOCATION, DIRECTION, and SOURCE has not 
been given for the majority of languages. Most likely, the difference in most languages 
is indicated with case markers as in (Eastern) Old Japanese (iNtu-yu ‘where from’), Japa- 
nese (doko ni ‘where (to), doko e ‘where to’, doko kara ‘where from’, my knowledge), or 
Ura (?uda=ne ‘where’, ?uda-gatfi ‘where to’, van der Lubbe & Tokunaga 2015: 361). 

In modern Japanese only a few forms in i- survive (e.g., itsu, ikura), which is due to 
a replacement with forms built on the stem do-. The fact that all forms are analyzable 
shows that this is a relatively new system. In fact, the interrogative stem do- in Japanese 
is completely in line with the demonstratives (Table 5.48). These paradigms were clearly 
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at least partly present in Old Japanese (Table 5.49). But in standard Japanese the distal 
demonstrative ka- has been replaced with a- and Old Japanese still lacked the stem do-. 
Interestingly, written pre-modern Japanese still had forms based on the stems ka- and 


idzu- (Table 5.50). In Japanese the word kare started out as a demonstrative, changed its 


meaning to a male third person pronoun and also means ‘boyfriend’ today. 


Table 5.48: Parallels in demonstratives and interrogatives in Japanese (based 
on Dixon 2012: 407; Hasegawa 2015: 332); the Kansai dialect has a regular form 


a-ko instead of the irregular a.so-ko; some endings were omitted 


proximal medial distal interrogative 
pronominal ko-re so-re a-re do-re 
adnominal ko-no so-no a-no do-no 
place ko-ko so-ko a.so-ko do-ko 
thing/person (vulgar) ko-itsu so-itsu a-itsu do-itsu 
direction/person (polite) ko-chira so-chira a-chira do-chira 
type/kind ko-nna so-nna a-nna do-nna 
adverb ko-o so-0 a-a do-o 


Table 5.49: Old Japanese demonstrative and interrogative paradigms (Vovin 
2005: 272; Kupchik 2011: 583, partly modified); there are additional forms such 


as wote ‘that (over there)’ not shown here 


proximal medial distal interrogative 
pronominal k6-(re) só-(re) ka-(re) WOJ iNtu-re 
adnominal kó-nó só-nó ka-nó ? 
place kó-kó só-kó - WOJ iNtu-ku 


Table 5.50: Paradigms of written pre-modern Japanese demonstrative and in- 


terrogative paradigms (Aston 1904: 60ff.) 


proximal medial distal interrogative 
pronominal ko-(re) so-(re) ka-(re) idzu-(re) 
adnominal ko-no so-no ka-no idzu-re-no 
place ko-ko ?so-ko p idzu-ko 


This paradigmatic parallel between pre-modern idzu- and modern do- might suggest 
that it is in fact the same etymological entity in a different phonological shape. In some 


Ryükyüan languages there is a form without the initial vowel as well. For example, Oki- 
noerabu ?uduru ‘which’ and ?uda ‘where’ (van der Lubbe & Tokunaga 2015: 350) must 
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Table 5.51: Paradigms of Hachijo demonstrative and interrogative paradigms 
(Kokuritsu Kokugo Kenkyüjo 1950: 204f.); cf. dare ‘who’; several dialectal forms 
were omitted 


proximal medial distal interrogative 
pronominal ko-re so-re u-re do-re 
adnominal ko-no so-no u-no do-no 
place ko-ko so-ko u-ko do-ko 


directly correspond to diru and daa in Yuwan. The paradigms in Hachijo are very sim- 
ilar to modern Japanese, but there is a different distal stem u- that looks similar to the 
medial stem in Ryükyüan (Table 5.51). In general, the Northern Ryükyüan languages, 
especially Amami Ryükyüan languages, have a pattern very similar to Japanese. Except 
for Miyara, the Southern Ryükyüan languages do not exhibit the same similarities in 
demonstrative and interrogative paradigms. Table 5.52 to Table 5.56 show paradigms for 
those languages that were described in sufficient detail. Also, northern Ryükyüan shares 
the distal stem a- with modern Japanese, while southern Ryükyüan still has ka-, as does 
Old Japanese. What is more, the extension of the demonstrative and the interrogative are 
only found in northern Ryükyüan and are not necessarily identical in form. In Yuwan 
and Shuri, for example, the demonstratives have the extension -ri - -ri, but the demon- 
strative has -ru. In Dunan, the extension can only be found in the distal demonstrative. 
Apparently, instead of the selective interrogative, Yonaguni uses an objective inter- 


Table 5.52: Paradigms of Yuwan (Amami) demonstrative and interrogative 
paradigms (Niinaga 2010: 50f.); cf. ta-ri/ru ‘who’; see also Martin (1970: 123- 


124) 
proximal medial distal interrogative 
pronominal ku-ri u-ri a-ri di-ru 
adnominal ku-n u-n a-n di-n 
place ku-ma u-ma a-ma daa 


Table 5.53: Paradigms of Shuri (Okinawan) demonstrative and interrogative 
paradigms (Miyara 2015: 387); form in square brackets from OCLS (1999/2003) 


proximal medial distal interrogative 
pronominal ku-ri Tu-ri 7a-ri dzi-ru 
adnominal ku-nu ?u-nu 7a-nu [dzi-nu] 


place ku-ma ?u-ma 7a-ma maa 
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Table 5.54: Paradigms of Ogami (Miyako) demonstrative and interrogative 
paradigms (Pellard 2009: 123; 2010: 129), cf. ta-ru ‘who’; no forms with the 
Ogami adnominal (genitive) -nu are available; gaps are filled with forms from 
Miyako proper in square brackets (OCLS 1999/2003) 


proximal medial distal interrogative 
pronominal ku-ri u-ri ka-ri nti - iti 
adnominal [ku-nu] ? [ka-nu] [nza-nu] 
place ?ku-ma u-ma ka-ma nta - ita 


Table 5.55: Paradigms of Miyara (Yaeyama) demonstrative and interrogative 
paradigms (Izuyama 2003: 24), cf. ta-ru ‘who’; there are also the forms Nge ~ 
Nga ‘there (medial) and ziNge ~ ziNga ‘where’ (+ -ge ~ -ga) 


proximal medial distal interrogative 
pronominal ku-ri u-ri ka-ri zt 
adnominal ku-nu u-nu ka-nu ? 
place ku-ma u-ma ka-ma zi-ma 


rogative, e.g. Sonai nu ‘what’ nu-nu ‘what-apy’ (Izuyama 2012: 431). 

Less complicated than the locative forms are the quantitative interrogatives ‘how 
much’ and ‘how many’ that are based on PJ “eku. Two suffixes, -Nta (maybe a collective) 
and -ra (maybe a plural) can sometimes be found attached to the stem (Vovin 2005: 330, 
fn. 129). Whether *eku was analyzable or not remains an open question. Middle Japanese 
had another variant iku-tu ‘how many’ that is not attested in Old Japanese. Ryükyüan 
languages have cognates of Old Japanese "eku and *ekura as well as of Middle Japanese 
ikutu. Similar to *eka the initial vowel was sometimes lost and in some cases led to the 
palatalization of the following velar, e.g. Benoki kassaa (Vovin 2005: 332), but Yuwan ik- 
jassa (Niinaga 2010: 51) < iku-ra ‘how much’. In some languages the interrogative “eku is 
preserved and is usually combined with a classifier, e.g. Okinoerabu Ziku-tfi ‘how many 
things’, 7iku-tai ‘how many people’ (van der Lubbe & Tokunaga 2015: 351). In Japanese 


Table 5.56: Paradigms of Dunan (Yonaguni) demonstrative and interrogative 
paradigms (Yamada et al. 2015: 454, 456f.) 


proximal medial distal interrogative 
pronominal khá ü kha-ri ? 
adnominal khu-nu u-nu kha-nu ? 
place khü-ma ü-ma khá-ma nmá 
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this pattern has been taken over by nan- followed by a classifier, e.g. nan-mei {14 ‘how 
many people’ (which is the source of Yilan name, Peng Qiu 2015: 53). 

Table 5.57 shows those interrogatives found in written and spoken pre-modern Jap- 
anese interrogatives. Except for those forms based on idzu, the interrogatives are still 
present in modern Japanese. There are the resonances i~ and n~. Today there is also a 
resonance in d~, but in written pre-modern Japanese, the interrogative tare ‘who’ was 
unique in that it did not exhibit any of the resonances. Japanese dare with an initial d 
might be an innovation based on dore. 


Table 5.57: Pre-modern Japanese interrogatives (Aston 1904: 63ff.); forms 
marked with an asterisk * are limited to the written language; not all deriva- 
tions are shown 


Meaning Form 

who ta-re*, da-re 

what nani 

what-PL nani-ra 
adnominal nani-no (» nanno) 
who (Japanese hito ‘man’) nani-bito 

why naze 

why nado 

which idzu-re^, do-re 
which (adnominal) idzu-re-no", do-no 
where idzu-ko*, do-ko 
whither idzu-chi*, do-chi 
whither idzu-kata* 

how, what manner ika 

how many iku-tsu 

how much iku-ra 

when itsu 


Few descriptions of Ryükyüan languages available to me give such an exhaustive list 
of interrogatives. Some questions are thus hard to answer. But the limited data allow the 
observation that, from a typological point of view, the interrogative systems are very 
different from one another. In Hateruma, for instance, all attested interrogatives except 
ici ‘when’ are only two phonemes long and none is readily analyzable synchronically 
(nu ‘what’, za ‘where’, ne ‘why, how’, ta ‘who’, Matayoshi 2010a: 199; 2015: 429). 
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Table 5.58: Ogami interrogatives (Pellard 2009: 132; 2010: 129); my tentative 
analysis based on Pellard (2009; 2010) 


Meaning Form Analysis 

who ta-ru ^ tau « PJ, loss of r, analogy to nau? 
when iks <PJ 

how many if- + CLF e.g., if-taut ‘how many people’ 
what nau <PR 

which nti ~ iti 

where nta ~ ita analogy to nti ~ iti 

why nau-ripa circumstantial converb -ripa 
how nau-pasi 

how much nau-nu-pusa, nti-ka-pusa 


In Ogami, on the other hand, the interrogatives are up to nine phonemes long and 
some are at least partly analyzable (Table 5.58). Ogami has two main resonances i- and 
n- as well as one form taru ‘who’ that does not partake in any of them. The two Ogami 
forms inquiring about quantity apparently are based on nau ‘what’ and nti ‘which’, re- 
spectively, and can be analyzed as nau-nu-pusa ~ nti-ka-pusa. The exact meaning of the 
suffixes remains unclear, however. A connection to the desiderative form -pus is unlikely 
on semantic grounds. The second part of nau-pasi also remains unclear. There is a cir- 
cumstantial converb form -ripa (Pellard 2009: 146) that might have been attached to a 
hypothetical interrogative verb nau- ‘to do what’, yielding nau-ripa ‘why’. 

In Yuwan nuusjattu probably has a similar background and may be an amalgamated 
form containing the elements nuu ‘what’, the verbalizer -s(j)ar, and the past causal con- 
verb -tattu (Niinaga 2010: 66, 71). Japanese do yatte literally means ‘doing how’ and can 
be analyzed into do ‘how’ and the so-called te-form (roughly gerund) of the verb yaru 
‘to do, to give, to put’. These few cases suffice to show a strong connection between the 
two categories of ACTIVITY and REASON (84.3). 

The Amami languages Yuwan and Shodon as well as the Okinawan language Shuri 
(Table 5.59) exhibit a pattern very similar to Japanese and have the three resonances n-, 
i~ and d~ (> dz in Shuri). But the languages preserve an initial unvoiced aspirated plosive 
t in the interrogative meaning ‘who’. 

The only polysemy that has been described can be found in Hateruma ne, which covers 
both MANNER and REASON. 

For the most part, interrogatives in Yilan Creole are identical or almost identical to 
Japanese (e.g., lare ‘who’, nani ‘what’, doko ‘where’, ikura ‘how much’, name ‘how many 
(people), Peng Qiu 2015: 52ff.). One interesting phenomenon as opposed to Standard 
Japanese (107c) is the use of an interrogative basically meaning ‘who’ instead of ‘what’ 
in questions about names, see also (140b) from Tarama (see Idiatov 2007; Holz) 2014b for 
a general discussion). This may be due to influence from Austronesian languages, maybe 
via Mandarin Chinese as spoken on Taiwan. 
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Table 5.59: Shodon (Martin 1970: 123f.), Yuwan (Niinaga 2010: 51), and Shuri 
(Miyara 2015: 387) interrogatives; accents removed, modified transcription for 
glottal stops; form in square brackets from OCLS (1999/2003) 


Meaning Shodon Yuwan Shuri 
who tha-r(u-), thaa-ga ta-ri ~ ta-ru taa 
what nu(u) nuu nuu 
why nuusjattu nuuntfi 
which one dir diru dziru 
which (adn.) din din [dzi-nu] 
where da(a) daa maa 
what kind of ?yikhyassyun 

when ?yit(i) ici ?itfi 
how many ?yitkhut(i) ~ ?yitkhu(u)t(i) 

how much ?yikhyassa ikjassa tfassa 
how tfaffi 


(159) Yilan Creole 
anta no namaega lare? 
2sG GEN name TOP who 


"What is your name?' (Peng Qiu 2015: 54) 


(160) Japanese 
anata no namae wa nan desu ka? 
2sG GEN name TOP what COP Q 


"What is your name?' (constructed in analogy to Yilan Creole) 


(161) Mandarin Chinese (Taiwan) 
ni de míngzishi shéi? 
2SG GEN name cop who 
"What is your name?’® 


(162) Maryinax Atayal 
ima? a’ ralu’=su’? 


who NOM.NRF name-2sG.BG 


"What is your name?' (Huang 1996: 271) 


This might be an areal trait that has its origin in Austronesian languages where it is a 
rather typical phenomenon (Blust 2013: 509f.). Standard Chinese as spoken in the Peo- 
ple's Republic of China usually employ the interrogative shénme ^what'. Other varieties 


8This sentence was given to me by a native speaker from Taiwan during my talk at The 8th International 
Conference on Construction Grammar (Hólzl 2014b). Chinese also has further constructions. 
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of Atayal such as Wulai in turn employ nanu’ ‘what’ instead of ima’ ‘who’ in official 
contexts, which may have its origin in Chinese (Huang 1996: 293). While in Yilan Creole 
the use of lare ‘who’ may have an origin in Austronesian, the whole construction rather 
resembles Chinese and especially Japanese, except for the lack of the copula. 


5.7 Koreanic 


5.7.1 Classification of Koreanic 


Korean has a North Korean (Pyongyang) and a South Korean (Seoul) standard. Here 
primarily the latter will be considered. In addition, Korean is officially recognized as 
a minority language in China, where it has developed its own standardized version of 
Korean based on the language spoken in Yanbian, Jilin province (L. Brown & Yeon 2015: 
466). But apart from the standard languages, Korean also contains a considerable amount 
of dialectal variation. Usually, six different dialect areas are recognized (L. Brown & Yeon 
2015: 461), but it has become increasingly clear that Yukcin has to be considered a seventh 
dialect (e.g., King 2006b: 130). 


Northwestern (Pyongan Province) 


Northeastern (Hamgyong Province) 


Yukcin (Hamgyong Province) 


(163) +} Central (Gyeonggi, Hwanghae, Gangwon, and Chungcheong Provinces) 


Southwestern (Jeolla Province) 


Southeastern (Gyeongsang Province) 


Jeju (Jeju island) 


Sohn (1999: 58) also differentiates between seven dialect zones, but instead of Yukcin 
he regards Chungcheong, included in the Central Dialect above, as a separate entity. 
Jeju clearly is the most aberrant member of the Korean dialects (e.g., Kiaer 2014). Vovin 
(2013b) even goes so far as to consider Jeju a Koreanic language in its own right. He 
claims that the primary division is between Jeju on the one side and the varieties spoken 
on the Korean Peninsula on the other. In his view, Yukcin, part ofthe Northeastern dialect 
area, is also sufficiently different from the rest of the dialects to consider it a separate 
language. But Sean (2015: 8) recently came to the rather convincing conclusion "that the 
early historical relationships among Koreanic variants are considerably non-treelike". In 
general, it may thus be better to conceptualize Koreanic as a dialect continuum with 
strong mutual contacts that make a classification into different languages problematic. 
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Within the Northeast Asian area, apart from the Korean Peninsula and adjacent re- 
gions in China, significant numbers of Korean speakers can also be found on Sakhalin, 
in Japan, and in Central Asia. The language in Central Asia, mostly in Uzbekistan and 
Kazakhstan, has its origin in Northeastern and Yukcin dialects, while the language spo- 
ken on Sakhalin is ultimately derived from the Southeast of Korea (King 2006b: 128). It 
is primarily the language spoken in Central Asia—also known as Kolyemal (Koryo lan- 
guage)—that will be included in this chapter. The Korean dialects in China are not very 
well described, but one can roughly state that “Yanbian Korean has its roots in Hamgyong 
dialect, whereas the variety of Korean spoken in Liaoning is of the Pyongan variety and 
that of Heilongj[i]ang is based on Gyeongsang” (L. Brown & Yeon 2015: 466, corrected). 
Given the scarcity of resources, only the variety spoken in Yanbian, Jilin province, will 
be included in this study (Zhao Xi 1982; Xuan Dewu et al. 1985). In Japan, apart from 
mainland Korean dialects, we also find speakers of Jeju, especially in 
isiOsaka (Saltzman 2014). 


5.7.2 Question marking in Koreanic 


When it comes to question marking, Korean has a complicated split system that depends 
on the speech level. The interrogative forms in Korean qualify as interrogative mood 
markers because they are in complementary distribution with declarative markers. In 
other words, the interrogative suffixes replace the declarative ones and are not merely 
attached to them. This is a major difference compared to most languages in Northeast 
Asia. 


(164) Korean (Jilin) 
a. narssi-ka tfoh-ta. 
weather-NOM good-DECL.PLAIN 
"Ihe weather is good: 
b. narssi-ka tfoh-ni? 
weather-NoM good-Q.PLAIN 
‘Is the weather good?’ (Xuan Dewu et al. 1985: 57) 


Descriptions disagree in the number of forms and speech levels in Korean. Table 5.60 
shows these according to the analysis by Song (2005), who distinguishes six different 
levels. There are declarative, interrogative, imperative, and propositive endings. The suf- 
fixes are usually called “sentence enders”, because they always take the last position in a 
sentence and are not restricted to verbs as such, but can also attach to verbal adjectives. 
Consider the following examples from Jilin Korean. 


(165) Korean (Jilin) 


a. ka-numka? 
£0-Q.FAM 


‘Are (you) going?’ 
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b. k‘u-nka? 
big-Q.FAM 
‘Is (it) big?’ (Xuan Dewu et al. 1985: 31) 


Table 5.60: Korean sentence enders (Song 2005: 125) 


Statements Questions Commands Proposals 
Plain -(n)ta -ni/-(nu)nya -ela/-ala -ca 
Intimate -e/-a -e/-a -e/-a -e/-a 
Familiar -ney -na/-nunka -key -sey 
Semi-formal -0 -0 -(u)o -(u)psita 
Polite -eyo/-ayo -eyo/-ayo -eyo/-ayo -eyo/-ayo 
Deferential -(su)pnita -(su)pnikka -(u)sipsio -(u)sipsita 


Some of the sentence enders can be further analyzed. The first element in -n-unya, 
-n-un-ka, and maybe in -n-i, as well as the medial element in -sup-ni-kka may be an 
indicative marker. The suffix -sup is an addressee honorific while the suffix -un has been 
called a “pre-nominal-modifier” suffix. The polite forms are identical with the intimate 
forms except for an additional suffix -yo (Sohn 1994: 337ff.). In the Chungcheong dialect, 
often included into the Central dialect, but treated as a separate dialect by Sohn (1999: 
58), this takes the characteristic form -yu (L. Brown & Yeon 2015: 462). 

Some of the forms in Table 5.60 are not restricted to one function. In fact, of the in- 
terrogative forms mentioned, only the plain, familiar, and deferential forms are not also 
found in statements, commands, or proposals. Sohn (2015: 449) lists additional variants 
for plain statements (-la instead of -ta) and semi-formal (-(s)o/-(s)wu instead of only -o) 
questions. 

The sentence endings in the officially recognized variety of Korean spoken in China 
are very similar to standard Korean (Table 5.61). The authors mention additional forms 
not shown here such as -tfi or -tfio, which are found in all sentence types. These probably 
correspond to the committal -ci and its combination with the polite marker -ci-yo » -cyo 
in Standard Korean (see below). There are, furthermore, the endings -(num)t/i, -(num)ja, 
and -najo that are restricted to the interrogative sentence type. Their exact difference in 
meaning remains unclear. But these are clearly combinations of other elements already 
encountered. The element -ntun is known from the complex familiar interrogative ending 
-num-ka and -najo is the familiar interrogative ending -na in combination with the suffix 
-jo known from the polite speech level. The last elements in -num-tfi and -nun-ja are 
probably the marker -t/i seen before and the intimate marker -(j)2/-a, respectively, both 
of which are speech act neutral. 

As opposed to standard Korean -(u)si-psita, Chinese Korean -(uri)psita lacks the ele- 
ment -si that is present in -(u)si-psio/-(ur)si-psiyo and has been characterized as a “sub- 
ject honorific suffix" (Sohn 1994: 344). For Standard Korean Kim-Renaud (2012: 151) men- 
tions an additional set of so-called “superdeferentials”, the interrogative form of which 
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Table 5.61: Sentence enders in Korean as spoken in China (Xuan Dewu et al. 
1985: 62f.; Zhao Xi 1982: 75) listed analogous to Table 5.60 


Statements Questions Commands Proposals 
Plain (-nur)-(n)ta -ni -(j)ara/-ara -tfa 
Intimate -(j)a/-a -(j)a/-a -(j)a/-a -(j)a/-a 
Familiar -ne -na/-(num)ka -ke -se 
Semi-formal -(s)o -(s)o -(s)o -(ux)psita 
Polite -(j)ajo/-ajo -(j)ajo/-ajo -(j)ajo/-ajo -(j)ajo/-ajo 
Deferential -(sur)pnita -(sur)pnikka -(ur)sipsiyo -(ux)psita 


is -((u)si)naikka. According to her, the familiar interrogative forms (called “deferential 
equal”) are -(n)(u)nka(yo) and -((u)si)na(yo). In the latter form, both the honorific suffix 
-si and the polite marker -yo are optional, and the same is true for -(uJsi-psita/-(u)psita 
and other sentence enders. Variants with either the vowel e or a depend on the vowel 
in the preceding syllable. The variant with a follows syllables that contain an a or an 
0, otherwise the variant with e is employed. This is a special kind of restricted vowel 
harmony still present in Korean. Table 5.62 shows all attested standard Korean variants 
with the help of two verbs and two adjectives. 


Table 5.62: Interrogative paradigms of two verbs and two adjectives in Korean 
(Sohn 1994: 15-16) 


Plain 


Intimate 
Familiar 


Semi-formal 


Polite 
Deferential 


mek- ‘to eat’ 


mek-ni 
mek-nunya 
mek-e 

mek-na 
mek-nunka 
mek-so 
mek-uo 
mek-eyo 
mek-supnikka 


po- ‘to see’ 


po-ni 
po-nunya 
po-a 
po-na 
po-nunka 
po-o 


po-ayo 
po-pnikka 


coh ‘good’ 


coh-(u)ni 
coh-unya 
coh-a 
(coh-na) 
coh-unka 
coh-so 


coh-ayo 
coh-supnikka 


. E 3 
S1 sour 


si-ni si-nya 


si-e 
(si-ne) si-nka 


si-o 


si-eyo 
si-pnikka 


The use of the different speech levels is highly complex and has been very well summa- 
rized by Song (2005: 126f.), whose concise description is worth quoting in an abbreviated 


form. See Brown (2011) for details. 


The plain speech style is used between friends or siblings whose age difference is 
not substantial (perhaps a one or two year age gap; in Korean culture, a three or 
more year age difference is regarded as substantial), or by old speakers (e.g. parents 
or teachers) to young children. [...] 
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The intimate speech level is referred to as panmal ‘half talk’ in Korean. This level 
is similar to the plain level in that it is used between close friends and siblings 
(both before middle age), by young school children to adult family members (espe- 
cially their (grand)mother but probably not their (grand)father) or by a man to his 
(younger) wife. [...] 


The familiar speech level is used to someone who has a lower social status than the 
speaker. When this level is chosen, however, the speaker is signal[l]ing a reasonable 
amount of courtesy to the hearer. [...] it is typically used by male adults to younger 
male adults who are probably under the former’s influence (e.g. protégés or former 
students), or to their sons-in-law. [...] 


The semi-formal speech-level [...] has almost completely fallen into disuse and 
may indeed sound old-fashioned to young people’s ears. It is definitely a speech 
level associated with the older generation. If used, however, it is to someone with 
lower social status than the speaker and it is regarded as a slightly more courteous 
speech level than the familiar speech level. [...] 


The polite speech level, together with the intimate speech level, is the most com- 
monly used speech level, but, unlike the intimate speech level - which is emblem- 
atic of intimacy, familiarity or friendliness — it is used when politeness or courtesy 
is called for, regardless of the social status of the hearer, as long as they are old 
enough (university students and older). [...] 


Finally, the deferential speech level is the highest form of deference to the hearer. 
This speech level is thus used to people with unquestionable seniority. It is never 
used to someone with equal or inferior social status. [...] (my boldface) 


However complicated the internal division of question marking may be, it does not 
depend on the question type. The following content questions display the same question 
markers as did the polar questions above. Interrogatives remain in situ (Sohn 1999: 265) 
but nevertheless are often in sentence initial position. 


(166) 


(167) 


Korean (Jilin) 
muyas-ur ha-numka? 
what-Acc do-Q.FAM 


"What are (you) doing?’ (Xuan Dewu et al. 1985: 42) 


Korean 
mues-ul ha-ni? 
what-acc do-Q.PLAIN 


"What are (you) doing?' (Song 2005: 146)? 


Notice the slight dialectal differences such as the presence of an intervocalic consonant 
in Jilin Korean muyas as opposed to standard Korean mues (also cf. -(uz)si-psiyo versus 


Tn casual speech this sentence is said to be pronounced mwel hani. 


204 


5.7 Koreanic 


-(u)si-psio), as well as the difference in speech level. Alternative questions do not exhibit 
an obligatory disjunction. Instead, each alternative takes one of the interrogative sen- 
tence enders listed above. Naturally, the two markers have to be identical, i.e. are from 
the same speech level. 


(168) Korean (Jilin) 
kitf'a-ka mantfa o-r-ka, tfatontf a-ka o-r-ka? 
train-NOM first — arrive-PRS-Q.FAM Car-NOM arrive-PRS-Q.FAM 


‘Does the train or does the car arrive first?’ (Xuan Dewu et al. 1985: 94) 


(169) Korean 
wuli-ka ka-l-kka.yo salam-ul | ponay-l-kka.yo? 
We-NOM £0-PRS-Q.FAM person-ACC send-PRS-Q.FAM 
‘Shall we go or shall (we) send someone?’ (Sohn 1994: 122) 


In the latter example the same politeness marker -yo that we have already encountered 
in the polite level endings -a.yo - -e.yo is found in the Standard Korean example. 

There is an optional disjunction an-i-myen *NEG-COP-COND' that literally means and 
if not' (Sohn 1994: 20) and is thus a parallel to Mongolian eswel (85.8.2). 


(170) Korean 
yongho-ka te ^ khu-ni, animyen nami-kate ` khu-ni? 
PN-NOM more big-Q.PLAIN or PN-NOM more big-Q.PLAIN 


‘Is Yongho taller or Nami?’ (Sohn 1994: 20) 


Negative alternative questions may make use of a negative verb such as in the id- 
iomatic expression in (171). 


(171) Korean 
ka-I-kka ma-l-kka? 
£O-PRS-Q NEG-PRS-Q 
"whether to go or not' (Sohn 1999: 392) 


When the first alternative is a copula, the second alternative has to be the negative coun- 
terpart of a copula. 


(172) Korean 
canton iss-ni eps-ni? 
change COP-Q.PLAIN NEG-Q.PLAIN 
‘Do you have change or not?’ (Kim-Renaud 2012: 150) 


These are constructions very similar to those of surrounding languages such as Japonic, 
Mongolic, or Tungusic (see Chapter 6). 

Yoon (2010: 2783) investigated the relative frequency of question types. In this study 
there were 70% polar questions (including tag questions), 29% content questions and only 
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3% alternative questions. However, 15% of all polar questions were actually tag questions. 
The situation is thus very similar to Japanese (§5.6.2). Tag question markers usually have 
the form ku-ci-yo ‘do.so-coMM-POL’ (> kucyo) and are attached to a declarative sentence. 


Instead of completing a statement with a sentence ending and then adding a tag 
question such as ku-ci-yo, it is possible to put -ci or the contracted form of its 
negative form -canh [< -ci-anh] into the sentence ending of the main statement 
without using it in a separate tag question. Such a tag question marked in the 
sentence ending is called a “pseudo-tag question” by some researchers (Yoon 2010: 
2788, my brackets) 


The author has recorded the two examples in (173): 


(173) Korean 
a. hwuchwu | mac-cyo? 
black.pepper correct-CoMM.POL 
‘It is black pepper, right?’ 
b. wenlay khu-canh-a.yo? 
originally tall-CoMM.NEG-POL 


‘She has been tall since birth, right?’ (Yoon 2010: 2788) 


I was unable to find information on focus questions in the literature available to me. 
The following examples were elicited from a native speaker in South Korea via internet 
in April 2016. Focus was expressed in this case with word initial position of the focused 
element. The analysis roughly follows Song (2005). 


(174) Korean 

a. nayil hakkyo-ey ka-pnikka? 
tomorrow school-DIR g0-Q.DEF 
‘Do (you) go to school tomorrow?’ 

b. hakkyo-ey nayil ka-pnikka? 
school-p1r tomorrow go-Q.DEF 
‘Do (you) go to school tomorrow?’ 

c. tangsin-un nayil hakkyo ka-pnikka? 
2sc-TOP tomorrow school go-Q.DEF 


‘Do you go to school tomorrow?’ (elicited, slightly adjusted) 


Similar to Japanese, the question marker does not change its form and remains in sen- 
tence-final position. A topic marker -(n)un attaches to the focused pronoun in the last 
example that takes sentence initial position. The other sentences do not have an overt 
pronoun, as "Koreans tend to avoid second-person pronouns altogether" (Song 2005: 75). 
The second sentence differs from the first in the sentence initial position of hakkyo-ey 
(cf. Song 2005: 107). 
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Available descriptions of questions in Jeju are not very specific or detailed. Kiaer’s 
(2014: 13f.) otherwise good description only gives an unanalyzed list of 19 different inter- 
rogative endings: -ka(ko), -n’ga(go), -nya, -ne, -nda, -tia(ti), Je, cht -k’o/-llogo, -ra, men, 
-s6, -an, -sun/-mnekka, -ptega, -ptegang, -sugang, -sukkwa(gwa), and -suga(kka). Unfortu- 
nately, there is no information on the semantic or pragmatic differences between all 
these suffixes and it is doubtful that they all simply mark questions. One may only spec- 
ulate that they fall within different registers that are based on politeness. The interesting 
examples given by Kiaer (2014) lack a morpheme analysis and a glossing, which makes 
their analysis rather unclear. For instance, the three sentences in (175) were all translated 
as ‘Where are you going?" 


(175) Jeju 
a. Odi kamdi? 
b. ódüi kamini? 
c. ódi kamsini? 
"Where are you going?' (Kiaer 2014: 14, 16, 17) 


The interrogative ódi corresponds, of course, to Korean eti ‘where (to)’ and ka- in both 
languages means ‘to go’. The suffix m is considered to be a marker for the present 
tense but is better understood as an indicative marker (e.g., Saltzman 2014: passim). The 
analysis of -amsi as a marker for progressive aspect is equally problematic. The final -ni 
might be comparable to the plain question ending in Korean. But neither -ni nor -di are 
listed as an interrogative ending by Kiaer, who also leaves open the difference between 
ódi and óddii (maybe a typographic error). Sohn (1999) provides a more complete analysis 
ofJeju interrogative sentence enders, which is given in Table 5.63 below. Among these we 
find the two plain level question markers -(e)m-ti(ya) and -(e)m-sini, which correspond 
to -m-di and -m-sini in (175a, 175c), but no correspondence to -mini (175b) was found. 
Possibly, -mi-ni is the same ending as -(e)m-si-ni, but without the suffix -si. It may be 
noted that the expression "Where are you going?' is a common greeting in Korean that 
exists on different speech levels. In this expression the marker -si is optional on all speech 
levels, which corroborates the analysis of the Jeju ending as -mi-ni. 


(176) Korean 
a. eti — ka-si.pnikka? 
where go-Q.DEF 


b. eti — ka-pnikka? 
where go-Q.DEF 

c. eti — ka-s(i).eyo? 
where g0-Q.POL 

d. eti — ka-yo? 
where go-Q.Por 

e. eti — ka-si.o? 
where go-Q.sEMF 
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f. eti ka-o? 
where go-Q.sEMF 
g. eti — ka-si.na? 
where go-Q.FAM 
h. eti ka-na? 
where go-Q.FAM 
i eti ka-si.e? 
where go-Q.INT 
j eti ka-(a)? 
where go-(Q.INT) 
k. æti — ka-si.ni? 
where go-Q.PLAIN 
L eti ka-ni? 
where go-Q.PLAIN 


"Where are you going?’ (L. Brown 2011: 47; Iksop & Ramsey 2000: 264f.; 
Song 2005: 158; Yeon & Brown 2011: 8) 


Sohn (1999) includes the following Jeju example that corresponds functionally to the 
deferential speech level in the standard Korean example (176a) above. 


(177) Jeju 
etu ley ka-m-swu-kkwa? 
where to go-IND-AH-Q.DEF 


"Where are you going?' (Sohn 1999: 75, from Lee LS.) 


Saltzman (2014: 49) reanalyzed the sentence and calls ley (-re according to her) an abla- 
tive and -swu (-su in her rendering) a formal present tense marker, both of which are 
problematic. If ley indeed functions as an ablative, the sentence should rather have been 
translated as something like "Where do you come from?' In fact, according to Sohn (1999: 
75), Standard Korean may add the marker lo instead of ley. Clearly, this is the instrumen- 
tal or directional case marker (u)lo and not an ablative (Song 2005: 115). A comparable 
sentence from Jeju in the past tense given by Kiaer (2014) is the following: 


(178) Jeju 
ódi | ka-ng wa-m-su-gwa? 
where go-PST ?AUX-IND-AH-Q.DEF 


"Where did you go?' (Kiaer 2014: 10, my tentative analysis) 


Here the marker -m-su-gwa is the same as -m-su-kkwa in (177) above and corresponds 
to the standard Korean deferential interrogative -(su)p-ni-kka. Note that -sup (-p when 
following a vowel) is an addressee honorific suffix, -ni is an indicative marker and only 
-kka is the actual question marker (Sohn 1994: 341). Thus, phonological differences apart, 
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Jeju -m-swu-kkwa (-m-su-gwa) and standard Korean -sup-ni-kka contain the same func- 
tional elements but apparently use the addressee honorific suffix and the indicative 
marker in reversed order. 

Apart from Jeju, other dialects have special sentence enders as well. Table 5.63 sum- 
marizes those dialectal interrogative sentence enders that deviate from the standard lan- 
guage. Question marking in the Chungcheong dialect is very similar to Standard Korean, 
but -o, -e-yo, and -sup-ni-kka have the forms -wu, -e-yu, and -sup-ni-kkya instead, which 
exhibit slight phonological differences. Other endings such as -nya are identical: 


(179) 


Korean (Chungcheong) 


ni pap mek-ess-nya? 
2sc meal eat-PST-Q.PLAIN 


‘Did you have your meal?’ (Sohn 1999: 71) 


Table 5.63: Selected interrogative sentence enders in Korean dialects based on 
Sohn (1999: 66-76); some dialectal forms identical to standard forms were ex- 
cluded; see also Yeon (2012) 


PLAIN 


INT, FAM, SEMF 


POL, DEF 


Jeju 
Hamgyong 
Pyongan 
Jeolla 


Gyeongsang 


Standard 


-(e)m-ti(ya), 
-(e)m-sini, -esinya 
-wa, -m 


-(u)m-ma, [-(u)wa]?? 


-eya, -nya 

PQ -na, cop -ka 
CQ -no, cop -ko 
-ni, -(n)u-nya 


-m-kka, -m-kko, 
[-em-se], -(e)m-singa 
-wu, [-m-mey], 
[-cipi], -nungka, 
-m-twu(ng) 
[-(u)m-mey], -wu, 
-kan 


[-elawu] 
-neng-kyo 


[-e], -na, [-(u)o] 


-(e)m-swu-kkwa, 
(-wu)-kkwa 
-sswu-ta, -m-mengi, 
-p-syo, -p-m-mi-kka 


-(u)op-ni-kka, 
-(su)p-ney-kka, 
-(su)p-mey-kka 
-(su)p-ni-kkye 
(-(si)p)-ni-kk(y)e 


[-e.yo], -(su)p-ni-kka 


Square brackets in Table 5.63 indicate forms that are not restricted to questions. Some 
examples from the dialects follow. 


(180) 


Korean (Hamgyong) 
ka-wu? 
go-Q.FAM 


‘Does (she) go?’ (Sohn 1999: 67) 


20 The declarative form is -(u)wa-yo. 
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(181) Korean (Pyongan) 
etu.m ey ka-si-p-ney-kka? 
where to go-SH-AH-IND-Q 
"Where are you going?’ (Sohn 1999: 69) 


(182) Korean (Jeolla) 
ni pap muk-ess-nya? 
2sG meal eat-PST-Q.PLAIN 
‘Did you have your meal?’ (Sohn 1999: 74) 


Several examples from the 19th century, mostly based on the Pyongan dialect (King 1987: 
238), can be found in the Corean Primer by Ross (1877). For example, the ender -um-mé 
in (183) corresponds to -(u)m-mey in modern Hamgyong and Pyongan dialects. 


(183) Korean (Pyongan) 
moosoon bápi iss-um.mé? 
what meal coP-Q.FAM 


"What food is there?’ (Ross 1877: 13) 


For other dialects equally old materials are not available to me. 

There are differences in intonation as well. In Jeolla and Chungcheong both falling 
and rising intonation are possible, whereas the standard Korean equivalent necessarily 
has rising intonation. Polar questions in Gyeongsang generally have a falling intonation. 
See Sohn (1999: 66-76) and Jeon (2015) for additional information. 

Table 5.63 does not list forms encountered in Yukcin or Kolyemal. But some informa- 
tion on these dialects has been collected by Ross J. King. Instead of the standard Korean 
-(su)p-ni-kka, Yukcin has -mdung (King 1987: 238), which appears to have a cognate in 
Kolyemal -(i)mdo ~ -mdu (King 1987: 262). To my knowledge, no other Korean dialect 
mentioned thus far has a comparable form (Table 5.63). Kolyemal furthermore has -na, 
-o, and -ja, which correspond to Standard Korean -na, -o, and -nya, respectively. There 
are two polite markers, -ga and -ge that exhibit the same vowel difference as -a ~ -e in 
Standard Korean. But their exact etymology and function remain unclear to me. 


(184) Korean (Kolyemal) 

a. misi-f X ha-ja? 
what-Acc do-Q.PLAIN 
"What are you doing?' 

b. ka-mdo? 
£0-Q.DEF 
"Are you going?' 

c. ədi-ř ka-n.ga? 
where-Acc go-Q.Por 
"Where are you going?' (King 1987: 243, 262) 
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There is insufficient information on tag, focus, and alternative questions from the di- 
alects. But like Standard Korean, almost all dialects have the same marking in polar and 
content questions. Gyeongsang is exceptional among modern dialects in making a dis- 
tinction between polar -no and content questions -na. After copulas these markers take 
the forms -ko and -ka, but preserve the distinction between polar and content questions. 
This distinction cannot be found in more honorific speech levels. 


(185) Korean (Gyeongsang) 


a. 


ni etey ka-ss-no? 

2sG where go-PST-Q.PLAIN 

"Where did you go?' 

i ke nwu chayk i-ko? 

this ?Nom who book COP-Q.PLAIN 
"Whose book is this?' 

pap mun-na? 

meal eat-Q.PLAIN 

‘Did you eat?’ 

kuk i ni chayk i-ka? 

that Zwou 2sc book COP-Q.PLAIN 
‘Is that your book?’ (Sohn 1999: 72) 


This pattern is a relic from Middle Korean that was lost in the other dialects during 
the Pre-Modern Korean period (Table 5.64). More exactly, the Middle Korean marker 
-ko was replaced by -ka, which from then on marked both polar and content questions 
(Sohn 2015: 456). 


Table 5.64: Selected Pre-Modern Korean verb endings in the go century (Sohn 


2015: 456) 
Statements Questions Commands Proposals 

Plain -ta/-la, -eta, -ma -nja/-njo, -lja -la, -ala/ela -ca 
Intimate -ci -a 
Familiar -ney,-lsjej, -msjej —-nka/-nko -kej -sej 
Semi-formal  -(s)o -o, -lka -o, -kwulje 
Polite -(j)o 
Deferential ^ -ita,-olsita, -oyta ^ -iska,-pnójka/-pnika  -sjosje,-psio -saita, -psita 


Several of these sentence enders still encountered in 19th century Korean are no longer 
in use in modern Standard Korean, e.g. the semi-formal interrogative ending -lka. 

The difference between polar and content questions was still present in Middle Korean, 
which also had a further question marker -ta that was later lost. A good description of 
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Middle Korean question marking and its relation to Contemporary Korean (CK) was 
recently given by Sohn (2015: 448). 


The interrogative endings were (a) -(k)o/-sko, (b) -(k)a/-ska, and (c) -ta. -(s)ko oc- 
cur[r]ed in question-word question sentences, and -(s)ka in yes-no questions. Both 
-(k)o and -(k)a also attached directly to a copula complement, as in i-nón sang-ka 
pel-a? (CK i-nun sang i-nka pel-inka?) ‘Is this a prize or a punishment?’ After the 
mood suffixes -ni [indicative] and -li [prospective], the endings -ko and -ka lost the 
consonant -k, and became new question endings -nio/-njo and -nia/-nja/-nje on the 
one hand and -lio/-Ijo and -lia/-lja/-lje on the other (CK -n[j]a/-ni; -lya). The ques- 
tion ender -ta, which is obsolete in CK, was frequently used in a sentence whose 
subject is a second person, as in kutuj-nón enu cek-uj tolao-l-ta? (CK kutay-nun 
encey tolao-keyss-eyo?) "When will you return?' The three-way (a, b, c) distinction 
has been lost in CK, except that the Gyeonsang dialect retains the -ko/-ka distinc- 
tion. (slightly corrected) 


Consider the following examples that illustrate the markers -ka, -ko, and -ta, respectively. 


(186) Middle Korean 


a. i twusalóm i  cinsillo nej hangkes-ka? 
this two person NoM truly 2sG.GEN master-Q 


‘Are these two persons truly your masters?’ 
b. hjenljang-ón sto mjes salóm-ko? 
wise.person-roP also how.many person-Q 
‘Also, how many wise people were there?’ 
c. kutujnón enu cek-uj tolao-l-ta? 
2sc-roP which time-Loc return-?PROS-Q 
‘When will you return?’ (Sohn 2012: 102, 103) 


As can be seen from the example given in the above quotation, alternative questions 
take two polar question markers. For a better understanding, the example is analyzed in 
more detail in (187). 


(187) Middle Korean 
i-nón  sang-ka pel-a? 
this-roP prize-Q punishment-Q 


‘Is this a prize or a punishment?’ (Sohn 2012: 102) 


The complete set of Middle Korean sentence enders is given in Table 5.65. As in modern 
Korean, there are four different sentence types, but only four speech levels. 

Old Korean had two interrogative sentence enders -ku 4, JÈ, B and -ka E, too, but 
both marked polar questions (Nam 2012: 58f.). The distinction between polar (-ka 7) and 


content (-ko dr, -s.ko.a WEB) question markers was only introduced in Late Old Korean 
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(Nam 2012: 66). The Old Korean question markers display a form similar to Tungusic and 
Mongolic on the one side (Old Korean -ku) and to Japonic on the other (Old Korean -ka) 
(§§5.10.2, 5.8.2, 5.6.2). Question marking in the Jurchenic branch of Tungusic strongly 
differs from the other branches. There are more and different question markers and all 
have forms similar to Koreanic (Table 5.66). 


Table 5.65: Middle Korean verb endings (Sohn 2015: 449) 


Statements Questions Commands Proposals 
Plain -ta/-la -ko/-ka, -nje/-njo, -ta — -(ke)la -cje (-cela) 
Neutral -ni/-noj -ni -kola/kolje ? 
Moderate respect -ng-ta -nó-ni-ska/sko -esje/-asje ? 
Deferential -ngi-ta -nó/ni-ngi-ska/sko -sjosje -sa-ngi-ta 


Table 5.66: Similar question markers in Middle Korean and Jurchenic (85.10.3) 


Middle Korean Manchu 

-ni -ni 

-nia/-nio -nio, ?Bala =n9 
-(k)a/-(k)o =o, Alchuka =(k)o 
?-nja; ?Korean -na; ?Gyeongsang -nA -nA 


The exact source and time of borrowing remain unclear. But since Classical Manchu 
already had all markers, they were borrowed before 1600. A major difference is that 
question markers replace declarative endings in Koreanic but usually attach to them in 
Jurchenic (note, however, forms such as Bala ana=no ‘go=Q’). Manchu =o usually seems 
to follow copulas (free or bound), which also speaks in favor of a connection with Korean. 
Remember that the Gyeongsang dialect has the form -ka ~ -ko following copulas, and 
Alchuka =ko preserves a velar plosive in this form as well. For instance, Manchu -mbi=o 
‘-IPFV=Q’ (containing the copula bi) exactly corresponds to Alchuka -mei=ko. Similar to 
Korean sentence enders, Jurchenic markers may also attach to non-verbal elements but 
remain in sentence-final position, e.g. Bala amin-ro 'father-Q'". Korean -ois not restricted 
to questions but may also mark imperatives, for instance, and Manchu also has a polite 
imperative marker -rAo that may contain the same element, possibly attached to the im- 
perfective participle -rA that also appears in the prohibitive ume V-rA. But more research 
with the help of large scale corpora is necessary to determine the exact meaning and use 
of those markers in Manchu. 
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5.7.3 Interrogatives in Koreanic 


Korean interrogatives exhibit two dominant resonances, e- and m-. The first has previ- 
ously been compared with Old Japanese (85.6.3). Similar to several other surrounding 
languages, the interrogative ‘who’ does not belong to any of these groups but rather 
starts with n-. In the Chungcheong dialect the resonance e- has the form we- instead 
(King 2006a: 267). Table 5.67 summarizes those interrogatives found in the literature 
available to me for Standard Korean, Korean as spoken in Jilin as well as Jeju. 

Sohn (1999: 69) mentions a Pyongan interrogative verb ekha 'to do how' that he ren- 
ders as the periphrastic sequence etheh-key ha- in standard Korean (ha- ‘to do’). Jeju 
has a periphrastic sequence atan-ha, too (Saltzman 2014: 65). The interrogative meaning 
‘who’ is an amalgamation of the original interrogative with the content question marker 
(Sohn 2015: 456). Note that nwuku still has the nominative form nwu-ka in Standard Ko- 
rean. The combination enu-cey ‘which time’ is the source of the contracted form encey 
‘when’ (Sohn 1999: 262). 


Table 5.67: Interrogatives from Korean (Sohn 1999: 208ff., 256, 273, 396, 403; 
Yoon 2010: 2784, in square brackets), Korean spoken in China (Xuan Dewu 
et al. 1985: 29, 161), and Jeju (Kiaer 2014; Cheng & Harrison 2014, in square 
brackets; Saltzman 2014, in parentheses) 


South Korea Jilin Jeju 
who nwu(ku) nwu-ka NOM  nuku [nuge] 
what mues muyes  musin'gó ~ musigó, [musinggeo] 
what + noun musun 
what + noun myech mjotf' 
(time) 
which enu onul ónügó, {ani} 
what kind of etten 
how ecci, ettehkey {atan} 
where eti ati ódi, {ati} 
when encey antfe 
how much elma arma 
how long [elmana] 
why way 


what sort of weyn 


There are also several forms meaning ‘who’ that are a combination of enu- with one of 
the three bound nouns ay ‘child’, salam ‘person’, and pun ‘respected person’ (Sohn 1999: 
207f., passim; Song 2005: 73, passim). The interrogative musun is derived from mues-i-n 
*what-coP-REL' and etten from e-tte-ha-n *which-kind-cop-nEr' (Sohn 1999: 256). Korean 
weyn similarly derives from way-i-n. A form without the relative marker -n but with 
the adverbializer -key ‘so that, to’ is probably the source of etteh-key ‘how’ (cf. Sohn 
1999: 376). In these forms e- seems to be the actual interrogative marker that must also 
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be the ultimate source of elma, ecci, enu, and eti. In Jeju the interrogatives musi(n’)-g6 
‘what’ (Korean mues) and ónü-gó ‘which’ (Korean enu) contain a suffix -gó that could 
correspond to Korean kes ‘thing’ or -kes-i ‘-thing-Nom’, which is regularly pronounced 
-key in informal speech (Song 2005: 155). This assumption is corroborated with data from 
Kolyemal, among which we find misi-ge ‘what thing’. That in Kolyemal ondze-ge ‘when’ 
(Korean encey) the same suffix is present is unlikely from a functional perspective. Ta- 
ble 5.68 summarizes Kolyemal interrogatives and their direct Korean cognates. 


Table 5.68: Kolyemal interrogatives in comparison with Korean (King 1987: 263; 


Sohn 1999) 
Kolyemal Korean 
who nugi nwu(ku) 
who (Acc/DIR) nugi-f nwukwu-l(ul) 
how many me(t) myech 
what (thing) misi-ge mues 
what (ACC/DIR) misi-f mues-ul (> mwe-l) 
what kind of musun musun 
where odi-mee eti 
whither (acc/DIR) odi-i eti-lo 
whence (ABL) odi-so 7eti-eyse 
when (thing?) ondze-ge encey 
why otftfze ecci 
how attokhze ettehkey 


The resonance e~ in Korean has the form o~ in Kolyemal. The case suffix -/ combines 
the function of a directive with that of an accusative, as can be seen in nugi-ř ^what-Acc' 
but ədi-ř ^where-prR'. In Korean both the accusative -(I)ul and the instrumental -(u)lo also 
have the function of a directive, but the first is likely the source of Kolyemal -* (Song 
2005: 112, 115). Kolyemal odi-mz, like Pyongan etu-m in example (19) above, derives from 
Middle Korean etu-mej (see below). 

Similar to Japanese (85.6.3), Korean displays parallel paradigms in demonstratives and 
one interrogative stem. Like Japanese (ko-, so-, and a-, older ka-), Korean has a three way 
distinction of demonstratives (i, ku, and ce). But while Japanese has exactly the same 
paradigms for the interrogative stem do-, the paradigm of Korean e- exhibits several 
irregularities (Table 5.69). 

The paradigms not only contain case endings but also certain bound nouns that have 
typological and probably areal parallels in Manchu (85.10.3). Unlike the adverbs yeki 
‘here’, keki ‘there’, and yeki ‘over there’, which are based on the demonstrative stems in 
combination with eki ‘place’, the interrogative eti ‘where’ has a case marker -ti. In Jeju, 
this suffix can also be found in the demonstratives (Table 5.70). 

As regards the irregular Jeju stem jo-, note that Korean also has the diminutive demon- 
strative stems yo, ko, and co (Sohn 1994: 114). 
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Table 5.69: Full paradigms of Korean demonstratives and the selective interrog- 
ative (Sohn 1994: 296) 


PROX PROX.H DIST which 
stem i ku ce e-nu 
eki ‘place’ yeki keki ceki eti 
kes ‘thing’ i kes ku kes ce kes enu kes 
il ‘thing, fact’ iil ku il ce il enu il 
kos ‘place’ ikos ku kos ce kos enu kos 
direction iccok ku ccok ce ccok enu ccok 
way i-li ku-li ce-li eti-lo 
kind of i-le-n ku-le-n ce-le-n e-tte-n 
to be ... way i-leh-ta ku-leh-ta ce-leh-ta e-tteh-ta 
to do ... way i-le-n-ta ku-le-n-ta ce-le-n-ta - 


Table 5.70: Jeju demonstratives and the selective interrogative in neutral and 
locative form (Saltzman 2014: 21) 


PROX PROX.H DIST which 
selective i~jo/ja ki tea A-ni 
locative jo-ti ki-ti tea-ti A-ti 


The interrogative enu ‘which’ is likely analyzable and based on the stem e-. The ending 
-nu might, according to Vovin (2005: 322), have a connection to a Japanese attributive 
ending (Old Japanese -nó). While the Jeju interrogative Ati ‘where’ can, at least synchron- 
ically, be analyzed as 4-ti ‘which-Loc’, this is probably not true for Korean eti. Diachron- 
ically, however, both Jeju Ati and Korean eti go back to Middle Korean e-tuj, the second 
part of which is a bound noun meaning ‘place’. Vovin (2005: 322) assumes that the form 
can be reconstructed as Proto-Korean(ic) *èntúy, thus allowing an analysis of the first 
part as the forerunner of Korean enu ‘which’ and a connection with Proto-Japonic *entu 
‘where’. His reasoning is based on the fact that the t should have regularly changed to l 
in this position without the n present. Middle Korean furthermore has an extended form 
etu-mej ‘where’ that might be comparable to Old Ryükyüan idu-ma (85.6.3). 

In general, the set of Middle Korean interrogatives is very similar to modern Korean, 
only one form (hjen ‘how many’) having been entirely lost (Table 5.71). The exact differ- 
ences between the forms meaning ‘what’ remain unclear to me. 

Vovin (2005: 319) mentions an additional Middle Korean form e:styé ~ e:sté ~ e:styéy 
‘how’ that, according to him, goes back to “e-is-ti 'how-exist-ADv". 
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Table 5.71: Middle Korean interrogatives (Sohn 2015: 98) in comparison with 
modern Korean 


Middle Korean Korean 
who nwu(-ko ~ -kwu) nwuku ~ nwu- 
how many mjes myech 
what musus, muzus, musu, musuk, musum mues, musun 
where etuj eti 
where etu-mej etu-m (Pyongan) 
whither etu-le eti-lo 
which enu, enwu, enó enu 
how much enma elma 
when enu-cjej encey 
how many hjen - 


5.8 Mongolic 


5.8.1 Classification of Mongolic 


Mongolic languages form a language family with about a dozen modern members. Ac- 
cording to Janhunen (2006: 232) they may be classified as in (5.4). Rybatzki (2003a: 388- 
389) assumes a slightly different classification with six groups. Of these, the so-called 
Northern (Khamnigan Mongol, Buryat) and South-Central groups (Shira Yughur) are 
part of Central Mongolic and Shirongolic, respectively, in Janhunen’s (2006) classifica- 
tion. 

The two classifications agree, however, in the number of languages as well as in some 
details such as the isolated positions of Dagur and Moghol. A recently discovered lan- 
guage that was added to (5.4) is called Kangjia and belongs to the Shirongolic branch. 
According to Kim (2003: 347), “the Kangjia ‘language’ would appear to be intermediate 
between Bonan and Santa.” However, it may actually be more closely related to Bonan 
(Siqinchaoketu 2002: 66). Central Mongolic has also been called Common Mongolic by 
Janhunen (2012b: 3f.), and is said to contain also the Khorchin group of dialects that was 
not listed as a separate entry. Khorchin is spoken in western parts of Manchuria (the 
modern provinces of Heilongjiang, Jilin, and Liaoning), but mostly in the adjacent parts 
of eastern Inner Mongolia (Janhunen 2012b: 4). Chakhar Mongolian, not listed above, be- 
longs to the same branch as Khalkha. It is spoken in Inner Mongolia and is said to be the 
language spoken by the descendants of the last emperor of the Mongolian Yuan dynasty 
and his followers who fled from Peking in 1368 (Sechenbaatar 2003: 1). Kalmyk, also not 
mentioned, can be considered an aberrant dialect of Oirat and is the only Mongolic lan- 
guage located in Europe. Moghol, located in Afghanistan, is probably extinct today and 
will for the most part be excluded here. 
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The Mongolic language Shira Yughur or Eastern Yughur (dongbü yügü vg 7k bia 


— Central Mongolic 4 


m Dagur(ic) — Dagur 


— Khamnigan Mongol 


Buryat 
— Mongol proper 


H Ordos 


—— Shirongolic + 


Oirat 


-—— Shira Yughur 


Huzhu Mongghul 


— Minhe Mangghuer 


Bonan 
Kangjia 


Santa 


——— Moghol(ic) — Moghol 


Figure 5.4: Classification of Mongolic 


E 


if 


in Chinese) should not be confused with the Turkic language Yellow Uyghur that is also 
called Sarig or Western Yughur (xibü yùgù yù Vi Apta lB in Chinese, see $5.11). There 


are also different Chinese designations for Bonan (bdoan yù D 


TZW), Santa (dongxiang 


yù Æ £ 18), and a collective name for Huzhu Mongghul and Minhe Mangghuer (tăzú yù 


ŻY), which are also known as Monguor in the West. Of the languages mentioned in 


(5.4) only Moghol is located outside Northeast Asia. All Mongolic languages except for 
Buryat and Kalmyk, which are for the most part spoken in Russia as well as Moghol in 
Afghanistan, are located within Mongolia and China. 
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5.8.2 Question marking in Mongolic 


Question marking in Mongolic is not very complex. Janhunen (2003d: 27) gives a good 
summary of the marking of questions in Proto-Mongolic. 


When no interrogative pronoun or pronominal verb was present in the sentence, 
interrogation in Proto-Mongolic was expressed by a sentence-final interrogative 
particle, which may be reconstructed as either *gü (> *=gU), as in Buryat and 
Khamnigan Mongol, or *xU (> *-UU), as in most other Mongolic languages. In 
questions containing an interrogative word, no particle was originally needed, but 
in Common Mongolic the copular form *bii-(y)i > *büi ‘being, present’ was gram- 
maticalized in such sentences into what may be termed a corrogative particle. 


Consider the following examples from Written Mongolian in which the forms are still 
relatively well preserved. 


(188) Written Mongolian 
a. ta sayin-uu? 
2PL good-Q 
"Are you well?' 
b. ta ken bui? 
2PL who coP»Q 


"Who are you?' (Janhunen 2003f: 53, transcription changed) 


The marking of polar questions with a sentence-final clitic is, of course, an areal trait. 
The development of a marker in content questions, on the other hand, sets Mongolic 
apart from most languages of the area. But similarly there exists a special content ques- 
tion marker in some Turkic languages that shares a functional background in a copula 
(85.11.2 and 86). 

In modern Mongolic languages the question markers have gone through phonetic 
erosion. As we will see further below the polar question marker fused with certain verb 
endings and copulas, especially in Shirongolic languages. Some individual Mongolic lan- 
guages have additionally adopted question markers from other languages. Consider the 
following examples from a variety of Buryat spoken in China. 


(189) Buryat (Shineken) 
a. xugs/in-tei oolz-aa=g=sla? 
old.woman-coM meet-P.IPFV=Q=2SG 
‘Did you meet the old woman?’ 
b. xen-tei oolz-aa=b=s‘a? 
who-coM meet-P.IPFV=Q=2SG 


‘Whom did you meet?’ 
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c. sii nam-tai os-no-ba? 
2sG 1SG.OBL-COM reach-PRS=Q 
"You go with me, don’t you?’ (Yamakoshi 2011a: 170-171) 


Sentence (1892) illustrates the polar question marker =go ~ =gu - =g, sentence (189b) the 
optional “corrogative” particle =be ~ =b, and sentence (189c) the marker =ba, which is a 
recent borrowing from Chinese ba ll. that can be found in several languages of China (86, 
and $5.9.2.1). In Shineken Buryat -ba is mutually exclusive with the agreement marker. 
Alternative questions display double marking with =go - =gu ~ =g. 


(190) Buryat (Shineken) 
bii enee-g-uur-ee ` jab-xa-g-bi, ^ teree-g-uur-ee jab-xa=g=bi? 
1sc.NOM this-E-INST-REFL g0-P.FUT=Q=1sG that-E-INST-REFL go-P.FUT-Q-1sG 
"Should I go in this or in that direction?' (Yamakoshi 2006: 153) 


In non-verbal sentences the content question marker can also attach to word classes 
such as adjectives and interrogatives. 


(191) Buryat (Shineken) 
a. slinii — xubuun-sle alin-be? 
2sG.GEN boy.NoM-2sG.Poss which=Q 
"Where is your boy?' (Yamakoshi 2011b: 116, shortened) 
b. al/an-in hain-be? 
which-3sc.Poss good=Q 
"Which one is good?' (Yamakoshi 2007b: 5) 


Janhunen (2003d) appears to believe that the question marker in Buryat and Kham- 
nigan Mongol has a different origin than the one found in other Mongolic languages. 
Interestingly, both Buryat and Khamnigan Mongol had intense contact with dialects of 
the Tungusic language Evenki. In both Khamnigan Evenki and Khamnigan Mongol the 
enclitic has the form -gv. Janhunen (1991: 95) speculated that it may have been bor- 
rowed from one language to the other, but left the direction of borrowing open. Given 
that many Tungusic languages preserve a cognate of the enclitic in Khamnigan Evenki 
(see §5.10.2), it seems likely that it was borrowed from Evenki into Khamnigan Mongol. 
But Khamnigan Evenki may reflect influence from Khamnigan Mongol, and in turn has 
lost the property of consonant alternation that is still present in Evenki proper (=gu ~ 
=ku ~ =nu ~ =vu). The enclitic =gi(i) in the Tungusic language Solon, on the other hand, 
is probably a secondary loan from a Mongolic source (possibly Buryat =gii). Apart from 
the not unlikely scenario that individual Mongolic languages have borrowed the Tungu- 
sic question marker, the other Mongolic question marker reconstructed by Janhunen as 
*xU, could potentially also have a very old connection to Proto-Tungusic "Ku because 
it already existed at the proto-level of both language families. As is often the case, the 
etymology of the markers is not transparent in either Mongolic or Tungusic. Also note 
a similar marker -ku (written as i. 3&. W0) in Old Korean (85.7.2). 
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The form of the question marker in Middle Mongol was probably =UU, that is =üü 
~ =uu. In Written Mongol (Uyghur script), the enclitic has the form -(ju)gu ~ -(ju)qu ~ 
-(ju)qhu when following vowels and -ugu ~ -uqu ~ -uqhu otherwise (Rybatzki 2003b: 79). 
According to Street (2008/09: 45), the plosives were not present in the spoken language 
but rather indicated a hiatus, which can be seen from other scripts used to write Middle 
Mongol. The vowel harmony may represent a problem for the comparison with Tungusic, 
but the older records of Middle Mongol show a strong functional similarity to Tungusic. 
While the enclitic has a strict sentence-final position in modern Mongolian, it was mobile 
at earlier stages and could attach to a focused element. In other words, the functional 
scope included not only polar but also focus questions. 


The interrogative particle in early Middle Mongolian was what may be termed a 
floating particle: for purposes of emphasis it could float from one point to another 
on the surface structure of a sentence, though at a deeper level remaining in con- 
struction with the remainder of the sentence as a whole [i.e., marking the whole 
sentence as question]. (Street 2008/09: 76, my square brackets) 


A typological parallel for a change from a mobile to a sentence-final question particle can 
be observed in the transition from Old to Modern Japanese (85.6.2). In Middle Mongolian 
alternative questions were also marked with the same enclitic that attached once on each 
alternative. Consider the following examples from Middle Mongol. 


(192) Middle Mongol (Arabic script; Secret History) 
a. burut-b-uu? 
escape-TERM-Q 
‘Did (you) escape?’ 
b. caq-u'u giir-be? 
time-Q arrive-TERM 
‘Has the time arrived?’ 
c. jób-ü'ü tab-u'u ügüle-rün? 
appropriate-Q ?convenient-Q say-CVB 


‘Saying: is it appropriate, is it convenient?’ (Rybatzki 2003b: 79) 


The same functional scope can be reconstructed for Proto-Tungusic (§5.10.2). Further- 
more, the two proto-languages combine this with a similar phonological shape, which 
is unlikely to be a coincidence. 

As indicated by Janhunen in the above quotation, the etymology of the marker "but 
is transparent and has its origin in a participle form of the copula *bü-, most likely the 
so-called deductive *-(y)i *PRs.1PFV' (Janhunen 2003d: 24). The term "corrogative" is fre- 
quently employed by Janhunen but has never been explained adequately from a func- 
tional perspective or in terms of grammaticalization. According to the analysis followed 
in this book, it may simply be called a content question marker. While in Mongolian it 
has an eroded form similar to Buryat, it may also appear in a form that is still identical 
to the copula. 
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(193) Mongolian 
cii xedzee yab-sen bwai? 
2sG when depart-Psr cop>Q 


"When did you go?' (Janhunen 2012b: 255) 


As noted above, a similar content question marker exists in some surrounding Turkic lan- 
guages that has its origin in a copula that in turn goes back to a demonstrative (§5.11.2). 

Dagur differs from other Mongolic languages in that there is a different polar question 
marker. There is no, or at least no obligatory, content question marker. 


(194) Dagur 
a. en bitig-yee? 
this book=Q 
‘Is this a book?’ 
b. fii ani-fi? 
2sG who-2sG 
‘Who are you?’ (Tsumagari 2003: 150; Chaolu Wu 1994b: 11) 


The data by Zhong Suchun (1982), collected in 1963 in Morin Daba, show a similar 
situation but make it clear that the polar question marker can be used optionally in 
content questions, too. This indicates that it is not only formally, but also functionally 


different from other Mongolic languages. Apparently, Dagur also has borrowed Chinese 
ba DÉI 


(195) Dagur (Morin Daba) 

a. ana warkal-ii waa-san=jaa? 
this clothes-Acc wash-PsT=Q 
‘Have (you) washed these clothes?’ 

b. xaana it[-boi-fii-jao? 
whither go-FUT-2sG=Q 
"Where are you going?’ 

c. dang  00-d3 ul  bol-sar baa? 
tobacco drink-CVB.IPFV NEG can-PST Q 


‘Smoking is prohibited here, right?’ (Zhong Suchun 1982: 76) 


There is one example of an alternative question that exhibits the marker jumoo once 
on each alternative. This is probably a recent loan from an Inner Mongolian dialect, in 
which the latter part is the question marker -UU, that will be further explained below. 
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(196) Dagur 
bii aido2 ^ jaw-oos-min? dzuyi-yu jum.oo?tiidaa ^ jaw-oos-min! 
1sc this.way go-cvB.coND-1sc right-IPFV Q that.way go-CVB.COND-1sG 
d;uyi-yu jum.oo? 
right-iPFv Q 
"Will I go this way or that way?' (Chaolu Wu 1994b: 18) 


In the Dagur dialect spoken in Tarbagatai (táchéng #43) in Xinjiang, the usual polar 
question marker has the vowel-harmonic forms -ja - -ja - -jo and also marks alternative 
as well as content questions. It remains unclear whether it can also be found in focus 
questions. The marker was given as a suffix but is reanalyzed as an enclitic here. 


(197) Dagur (Tacheng) 
a. argun-Sin ` xajir-san=ja? 
husband-2sc come.back-Psr-o 
‘Did your husband come back?’ 


b. saos-sin xərəə, baraan=ja ocog=jo? 
urine-2sc how.much much=g _little=Q 
‘How much urine do you have? Is it a lot or a little?’ 
c. Sii xajaa and ir-san-si=ja? 
2sc when here come-PsT-2sG-Q 
"When did you come here?' (Yu Wonsoo et al. 2008: 85, 86) 


The functional scope of the question marker in Dagur suggests an areal connection to 
several surrounding languages (86). 

In some content questions there is a copula that could be the “corrogative” form found 
in other Mongolic languages. As in Shineken Buryat the agreement marker follows the 
copula, but in Dagur the sentence additionally takes the usual question marker, which 
makes it unlikely that the copula fulfills the role of a question marker. 


(198) Dagur (Tacheng) 
Sii xaan-aar ir-san-b-Si=ja? 
2sG where-ABL Come-PST-COP-2SG=Q 
‘Where did you come from?’ (Yu Wonsoo et al. 2008: 86) 


According to Yu Wonsoo et al. (2008: 79), the marker -ja sometimes fuses with the pre- 
ceding suffix and the verb in example (198) is realized as /irzbisa/. If the element =ja that 
is sometimes found in questions in the Tungusic languages Sibe and Aihui Manchu is 
indeed a question marker, then its most likely source is Dagur. Clearly, Dagur was also 
the origin of the question marker Jee in Oroqen (see §5.10.2). 

To my knowledge there are no explicit descriptions of questions in Moghol, but Weiers 
(1972) mentions several examples of polar and content questions that appear to be gen- 
erally unmarked morphosyntactically. Presumably, there was a different intonation con- 
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tour that cannot be reconstructed for now. Given its peripheral position outside of North- 
east Asia, Moghol will not be further addressed here. 

The Northern subgroup of Mongolic as identified by Rybatzki (2003a), i.e. Khamnigan 
Mongol and Buryat, basically share the question marking of Shineken Buryat seen above. 
Both the polar question marker as well as the "corrogative" particle are still present in 
both languages. Khamnigan also has adopted the Mandarin marker ba IE. The Kham- 
nigan Mongol “corrogative” particle bei has been borrowed into Khamnigan Evenki 
(§5.10.2). 


(199) Khamnigan Mongol 

a. hain=gu? 
good=Q 
‘Is it good?’ 

b. tere ken bei? 
3sc who Q 
"Who is he?' (Janhunen 2003b: 97) 

c. ana kobcaxon-cini ^ torg-oor ajo-gd-20=gu, bisi-gu? 
this clothes-2scG.Poss silk-INST sew-PASS-P.IPFV=Q NEG=Q 
‘Are these clothes made of silk or not?’ 

d. bii kara-ku-du- min tabi-tee-ta ba? 
1SG.NOM see-P.FUT-DAT-1sG.POSS fifty-PROP-2PL Q 
‘I guess you are about fifty years old, right?’ (Yamakoshi 20072: 132, 127) 


As in Shineken Buryat, an agreement suffix may follow the question markers in Stan- 
dard Buryat. The marker =gii marks polar, alternative, and maybe focus questions. 


(200) Buryat 
a. yeshe münge-tei  hen-gü? 
PN  money-COM COP.PST-Q 
‘Did Yeshe have money?’ 
b. shi xen-tei-b-shi? 
2sc who-cOM=Q=2sG 
"Who are you with?' 
c. yaba-xa-m=gii, bai-xa-m-gü? 
go-P.FUT-1SG-Q stay-P.FUT-1sG=Q 
‘Shall I go or stay?’ (Skribnik 2003: 120, 119) 
This is probably also true for Khamnigan Mongol, but no example for a plain alternative 
question has been found in the relevant literature. 


In order to compensate for the lack of information in most grammatical descriptions, 
the following examples of Cyrillic Khalkha Mongolian were elicited in October 2015 
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from a Mongolian informant of Outer Mongolia living in Germany. The analysis and 
transcription partly follows Janhunen (2012b). As noted before, polar questions are usu- 
ally marked with the enclitic =UU. 


(201) Cyrillic Khalkha Mongolian 
ci surguul-ruu-g.aa yaw-j bai-g.aa ` youm-uu? 
2sG school-prR-POSS.REFL depart-CVB.IPFV COP-P.IPFV COP=Q 


"Are you going to school?’ 


As in this example (201), the enclitic sometimes combines with a copula, derived from 
a word meaning ‘thing’ (Janhunen 2012b: 221, 228). This form also appears in Dagur as 
=jumoo and some Tungusic languages (see §5.10.2), all of which were probably borrowed 
from central Mongolian dialects spoken in Inner Mongolia. It also seems likely that the 
polar question marker found its way from Mongolian (=uu ~ =00) into Oroqen (=00), 
where it has an additional meaning of fear or doubt. Focus questions are identical to polar 
questions in form but exhibit an additional intonational peak on the focused element 
(indicated by underlining in Mongolian and with italics in the translation). Unlike Middle 
Mongol and some Tungusic languages, the question marker does not express focus itself 
and cannot take any other position in the sentence. 


(202) Cyrillic Khalkha Mongolian 


ci surguul-ruu-g.aa yaw-j bai-g.aa ` youm-uu? 
2sG school-prR-POSS.REFL depart-CVB.IPFV COP-P.IPFV COP-Q 


"Are you going to school?’ 
Both plain and negative alternative questions require two question markers as well 
as a disjunctive. The disjunctive eswel literally meaning '(and) if not’ could be analyzed 


as es-wel 'NEG-CVB.COND' and can also be employed as a standard disjunctive (Janhunen 
2012b: 221). This has a typological parallel in Korean an-i-myen ‘NEG-COP-COND’ (85.7.2). 


(203) Cyrillic Khalkha Mongolian 
a. ci tzai uu-x-uu, eswel airag uu-x-uu? 
2sc tea drink-P.FUT-Q or kumis drink-p.FUT=Q 
‘Do you drink tea or kumis?' 
b. ci surguul-ruu-g.aa yaw-a.x=uu ` eswel yaw-a.x-güi-y.uu? 
2sG school-piR-POss.REFL depart-P.FUT=Q or ^ depart-P.FUT-NEG-Q 
‘Do you go to school or not?’ 


Alternative questions may also take the extended question marker youm-uu (Ben- 
jamin Brosig p.c. 2016). 


(204) Cyrillic Khalkha Mongolian 
ci tzai uu-x youm=uu, eswel airag uu-x youm-uu? 
2sc tea drink-P.FuT COP=Q Or kumis drink-P.FUT COP=Q 


‘Do you drink tea or kumis?' 
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Apparently, the question marker =UU has expanded its scope and sometimes also 
appears in content questions. 


(205) Cyrillic Khalkha Mongolian 
ci xedzee surguul-ruu-g.aa yaw-a.x-uu? 
2sG when  school-prR-Poss.nEFL depart-P.FUT=Q 


"When are you going to school?’ 
But according to other sources, Khalkha also has the expected "corrogative" particle. 


(206) Cyrillic Khalkha Mongolian 
xen tsai uu-san be? 
who tea drink-P.PFv Q 


‘Who drank tea?’ (Svantesson 2003: 171) 


Some verbal endings in Mongolian have a slightly different but predictable form in 
the interrogative than those in the declarative. These are summarized in Table 5.72. 


Table 5.72: Special interrogative endings in Mongolian according to Janhunen 
(2012b: 183f., 255, 298)); differences are marked with boldface 


Meaning Declarative Interrogative 
Mood VOL yab-ii.y yab-y-oo 
Finite DUR yab-e.n' yab-n-oo 
CONF yab-l(=aa) EMPH yab-l=00/yab-laa=y.oo 
TERM yab-eb yab-b-oo 
RES yab-j yab-j=oo 
Participle FUT yab-ex yab-x-oo 
HAB yab-deg yab-dg-oo 
PRF yab-sen yab-sn-oo 
IMPRF yab-aa yab-aa-y.oo 


Similarly to other languages of the region, descriptions of Mongolic languages usually 
do not mention tag questions and it remains open whether they are absent or were simply 
ignored. The following elicited example is marked with a marker tee that appears to be 
ultimately derived from the distal demonstrative te- and can roughly be translated as ‘is 
it like this?". 


(207) Cyrillic Khalkha Mongolian 
ci surguul ruu-g.aa yab-na tee? 
2sG school DIR-POSS.REFL depart-DUR so 


"You are going to school, right?' 
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Another tag question type encountered in Mongolian consists of a negative copula 
followed by a polar question marker. 


(208) Darkhat Mongolian 
ir-sen biš=00? 
come-P.PFV NEG=Q 
‘(S)he arrived, didn’t (s)he? (Ragagnin 2011: 188) 


Descriptions of Mongolic languages usually also do not mention intonation contours. 
But Karlsson (2003: 192) made the following interesting observations for Khalkha Mon- 
golian. 


Focus in questions is signaled by a rising gesture, the LH [low high]. However, 
depending on the segmental conditions, the gesture can be realized just as a tonal 
peak, synchronized with the second mora, making it similar to the focal H in declar- 
atives. Interrogatives have a terminal low boundary tone, which is characteristic 
for most informants, while the high final rise is optional. All this makes the into- 
nation of interrogatives similar to that of declaratives. The reason for this seems to 
be the strong formal signaling of interrogatives by using question particles. Thus, 
intonation has a redundant role in forming the interrogative mode in Mongolian. 
(my square brackets) 


We may thus conclude the following: polar questions are obligatorily marked with the 
enclitic =UU and have an optional rising intonation. In focus questions there is an ad- 
ditional peak on the focused element. This makes the structure of focus questions quite 
different from Middle Mongol, where, as seen above, the enclitic attaches to the element 
in focus. In addition, interrogatives in content questions obligatorily receive “the same 
tonal gesture” as focused elements in focus questions (Svantesson et al. 2005: 93). 

In Chakhar Mongolian the polar question marker is -UU (24d, =uu) or -y UU when 
following a vowel. According to Sechenbaatar (2003: 182) "the material shape of the in- 
terrogative particle links Chakhar with Khalkha, marking a distinction with regard to 
several other Inner Mongolian dialects, such as, for instance, Baarin, in which the inter- 
rogative particle appears in the invariable shape =ii. The optional “corrogative” particle 
has the form =w ~ =b or =wéé - =béé. The forms with a plosive are found following the 
nasals m or y. 


(209) Chakhar Mongolian 
a. ax-cin ónóóodór yaw-n-uu? 
brother=2sc today depart-puR=Q 
‘Is your brother leaving today?’ 
b. en yamar yum-béé? 
this what thing=Q 
"What kind of thing is this?' (Sechenbaatar 2003: 182, 107) 
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Khorchin Mongolian likewise has the enclitic =(j)UU that marks polar, alternative, 
and possibly focus questions. 


(210) Khorchin Mongolian 
a. ana bol nom-oo? 
this Top book=Q 
‘Is this a book?’ 
b. bii anuur jab-aal taar-n-oo, tiis-aa jab-aal taar-n-o0? 
1sG here go-cvB.cowD fit-prs=Q there-REFL go-CVB.COND fit-PRs-Q 
‘Do I have to go in this or that direction?’ (Yamakoshi 2015: 282, 292) 


There is also an enclitic =(j)ii that marks polar questions as well as, optionally, content 
questions. This might indicate an areal connection to Ainu, Dagur, Korean, Japanese, 
Manchu, Ogami, and Ulcha (§6). Perhaps, the expansion of Khalkha =(y)UU can also be 
explained as an areal trait connected to this. 


(211) Khorchin Mongolian 

a. ter un  jeb-lee=j.ii? 
that person depart-coNF=Q 
"Did that man go?' (Chaganhada 1991: 71) 

b. éii xon-ji? 
2PL who=Q 
"Who are you?' 

c. Cinii — aab-Cin' xaa bii? 
2sG.GEN father-2sc.Poss where CoP 
"Where is your father?' (Yamakoshi 2015: 282, 284) 


However, Khorchin might also exhibit the “corrogative” marker. Compare the follow- 
ing two examples from Khorchin and Khalkha, respectively (Benjamin Brosig p.c. 2018). 


(212) Khorchin Mongolian 
an texaa  xon-ee jimee? 
this chicken who-GEN CoP.Q 


"Whose chicken is this?’ (Chaganhada 1991: 71) 


(213) Cyrillic Khalkha Mongolian 
en taxyaa xen-ii(-x) youm bwai? 
this chicken who-GEN(-NOM) coP Q 


"Whose chicken is this?’ 


Without the nominalizer, youm is perhaps better understood as ‘thing’. 


228 


5.8 Mongolic 


According to Brosig (2014: 15), Khorchin has two further question markers =me and 
=mu. Their exact scope and etymological relation remain unclear. However, =me appar- 
ently can mark polar and content questions while =mu appears at least in polar and 
alternative questions, e.g. nogon=mu, xar=mu? 'Is (it) green or black?’ (Brosig 2014: 15). 


(214) Khorchin Mongolian 
zaqi-d yuu xii-me? 
PN-DAT what do=Q 


"What are you going to do in Jarud?"?! (Brosig 2014: 15) 


An imperfective marker -n is said to have been assimilated to the following question 
marker in this example. Khorchin also has borrowed the Mandarin marker ba IE. Accord- 
ing to Chaganhada (1991: 72) it has a long vowel (baa), just like the adjacent languages 
Solon, Oroqen, and Dagur. 


(215) Khorchin Mongolian 
Cinii ^ ax=Cin’ bas dué bol-oodue-ba? 
2sG.GEN e.brother-2sc.Poss also forty become-NEG.IPFV=Q 
"Your elder brother is not yet forty, right?' (Yamakoshi 2015: 287) 


An authochthonous equivalent of Mandarin ba IE used mostly by older speakers is 
the combination =i=dee (Brosig 2014: 16). In tag questions either ba or =(y)UU may be 
employed, which has parallels in the Tungusic language Sibe (85.10.2) and in Mandarin 
(85.9.2). 


(216) Khorchin Mongolian 


a. tərə xun bax$ _ bisa, tiim=ba? 
that person teacher NEG like.that=Q 
‘That person is not a teacher, right?’ 
b. xadan čag bol-j-ji? xajor éag-oo? tiim-uu? 
when hour become-cvB.iPFv-Q two hour=Q like.that=Q 
"What time is it? Two o'clock? Right?' (Yamakoshi 2015: 283, 286) 


Benjamin Brosig (p.c. 2016) mentions a couple of additional particles such as qi (identi- 
cal to fii below) of not absolutely clear origin. It is perhaps best classified as a tag question 
marker. Mongolian has a negative copula, bish in the spoken and bous in the literary lan- 
guage, that might somehow be connected to a word meaning ‘other’ (Janhunen 2003d: 
27; Janhunen 2012b: 251). There is a parallel grammaticalization of adjectives meaning 
‘different’ to a negative copula in Tungusic (Hólzl 2015a: 146). According to Chaganhada 
(1991) it has the form bifii in Khorchin and has developed into a question marker. Under 
my analysis, however, the final =ii might be nothing but the question marker. From this 
perspective, bif=ii is probably a tag question marker almost identical to Darkhat bis-oo 


to Chinese this place is called zāqí H. D. 
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and bish=uu in Mongolian according to Janhunen (2012b: 251). This construction has ex- 
act typological parallels in several Tungusic (§5.10.2) and Turkic languages (§5.11.2). In 
what appears to be another type of tag question, Khorchin bif may also be followed 
by an emphatic enclitic (bif=j.aa) that has the form =(y)AA in Mongolian according to 
Janhunen (2012b: 93). Perhaps the form fii is a contracted form of bif=ii. 


(217) Khorchin Mongolian 
an udur fin tabon sar-iin nogan fii? 
this day new five month-cEN one Q 
‘Is today not the first day of May?’ (Chaganhada 1991: 71) 


The same marker fii can also be found in tag questions following the element tii.n, which 
is probably derived from the distal demonstrative (cf. Janhunen 2012b: 130), similar to 
(207) from Khalka. Chaganhada (1991: 72) translates tiin fii, which may be attached to 
a declarative sentence, as a tag question. In Khalkha there is also a question tag tiim 
bish-üü (Benjamin Brosig p.c. 2018). 

There are few clear descriptions for questions in Ordos. But there is evidence that 
it preserves the original question marker as =(j)uu and lacks the “corrogative” particle 
(Stefan Georg p.c. 2015). 


(218) Ordos 
a. yabu-b=uu? 
gO-TERM=Q 
‘Did he go?’ (Georg 2003b: 208) 
b. tere juru ge-DZe-n? 
3sG what say-RES-3 
"What does he say?' (Mostaert 1937: lix) 


Most other dialects will be ignored here for lack of data and reasons of space. 

There are also few good materials for questions in Oirat, which is why there will first 
be a descriptions of questions in the closely related language (or aberrant dialect) Kalmyk. 
In Kalmyk the interrogative particle -u marks polar questions and similar to Mongolian 
(Table 5.72) fuses with some suffixes, e.g. -na ‘DUR’ vs. -nU ‘DUR.Q’ and -la ‘CONF’ vs. - 
lu coup o (Benzing 1985: 42). The “corrogative” particle is preserved as =b ~ =w, e.g., 
kem=b? ‘who is it?’. In addition, there is another question marker -iy ~ =i that seems to 
be employed in alternative questions as well as polar questions, e.g. xol-iy? ‘Is it far?’. 


(219) Kalmyk 

a. ter ir-v-u? 
3sc come-PFV=Q 
‘Did he come?’ 

b. endr yamaran 6dr? sän | ódr-iy mu ódr-iy? 
today which day good day=Q bad day-o 
"What kind of day is it today, (is it) a good day or a bad day?' (Benzing 1985: 
42f.) 
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Note that in this example a content question is followed by an alternative question (see 
§4.4). Blasing (2003) does not mention the marker, but quite clearly, this is the same ele- 
ment we have already seen above, e.g. Khorchin =(j)ii. The following Kalmyk sentences 
were elicited from a native speaker living in Germany in January 2016 via internet. The 
transliteration and analysis are mine but roughly follow Blasing (2003). 


(220) Kalmyk 
a. al’daran yow-jana-c? 
whither go-PROG-2sG 
"Where are you going?’ 
b. ci manhdur shkol-d ^ yow-jana-c? 
2sG tomorrow school-ALL go-PROG-2sG 


"Are you going to school tomorrow?' 


c. ci manhdur shkol-d ^ yow-jana-c? 
2sG tomorrow school-ALL go-PROG-2sG 


"Are you going to school tomorrow?' 


No question marker appears in content questions. In focus questions the focus is appar- 
ently expressed with the help of an additional peak on the focused element. 

The situation in Kalmyk is indeed very similar to Oirat proper, for which Birtalan 
(2003) mentions the question markers =UU ~ =(y)UU, - ii, as well as =w ~ =b. 


(221) Oirat 
sään báüün-uu? 
good cop=Q 


‘Are you well?’ (Birtalan 2003: 227) 


As opposed to Benzing, she treats the form - ii, which we have already encountered in 
Baarin, Khorchin, and Kalmyk, as a variant of the polar question marker. In fact, this is 
the most likely analysis as its form - ii ~ =y.iiis completely parallel to the standard marker 
=UU ~ =y.UU (Janhunen 2012b: 183). In some Tungusic languages there are question 
markers that were probably borrowed from Mongolian =(y)ii, notably Ongkor Solon -ii 
as well as, less likely due to geographical distance, Even -ii, Negidal -i, and maybe Uilta 
-(y)i (8510.2). 

Shirongolic languages also preserve the original polar question marker, but display a 
more complicated picture than Central Mongolic. In Shira Yughur—classified by Rybatzki 
(20032) as the only South-Central language instead of as Shirongolic—the polar question 
marker -uu ~ =j.uu sometimes fused with the preceding verb ending, but there is no 
clear information as to when and how often this happened. The durative marker -nAi 
(and variants) always has the form -nam before the question particle. The “corrogative” 
particle appears to be optional. Alternative as well as negative alternative questions take 
two question markers. 
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(222) Shira Yughur 


a. tfa qutad larga f[da-dac-uu? 

2sG PN speak able-HAB-Q 

‘Can you speak Chinese?’ 
b. tfamiin dza jimar duge-da »ol-son bo? 

2sG.GEN be what time-DAT get-PST Q 

"Which year were you born in?’ (Chaolu Wu 1994d: 8, 9f.) 
c. tfa.ma-da — qudaxra bii-j.u, uyui-j.u? 

2sG.OBL-LOC knife EX=Q  NEG-Q 

‘Do you have a knife?’ (Zhaonasitu 1981a: 58) 


The last example (222c) is a negative alternative question that shows a negative existen- 
tial because of the existential in the first alternative. Similar to the situation in Khalkha 
before, there is also one example of the polar question marker in what appears to be a 
content question (cf. Mongolian -x-oo in Table 5.72 above). 


(223) Shira Yughur 
cimiin keen-di óg-k'uu? 
28G.ACC who-DAT give-FUT.Q 
‘To whom shall I give you?’ (Nugteren 2003: 280) 


In sum, Shira Yughur interrogative constructions pattern with Central Mongolic and 
have to be differentiated from the more complex system found in Shirongolic languages. 

Bonan, like Ordos, lacks the “corrogative” particle in content questions, which are 
morphsyntactically unmarked. 


(224) Bonan 
dzoma yala o0-to? 
PN ` where go-PFV 
"Where did Droma go?’ (Fried 2010: 261) 


For polar questions Bonan preserves the Mongolic interrogative marker that has the 
form -u. But its use is more complicated than in these Mongolic languages we have 
encountered before: "When -u is suffixed to imperfective lexical verbs, it replaces the 
imperfective suffix (-tci/-tco). Similarly, when it is suffixed to perfective verbs, it replaces 
the perfective suffixes -to and Tea (Fried 2010: 258) The suffix thus attaches directly to 
the verb stem. 


(225) Bonan 
tc'o nuda. nat'o-u? 
2sc today dance-Q 


"Did you dance today?' (Fried 2010: 259) 
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Table 5.73: Special interrogative forms in Bonan (Hugjiltu 2003: 339, 343) 


Meaning Declarative Interrogative 
NARR PRS, FUT -m -mu 
DUR PRS, FUT -na -nu 
TERM PST -wa > -0 -wu > -u 


Table 5.74: Special interrogative copula forms in Bonan (Hugjiltu 2003: 340, 


343) 
Subjective Objective Interrogative 
EXIST wi wa wu 
COP.EMPH mbi mba mbu 


The interrogative marker -u fused with several verb endings and copulas, see Ta- 
ble 5.73 and Table 5.74. 


(226) Bonan 


a. ode-nu? 
£O-DUR.Q 
"Will he go?' (Hugjiltu 2003: 343) 
b. tcin-da tca dawzi weu? 
2sc-DAT tea leaf  rxisT.Q 
‘Do you have tea?’ 


c. tə.rə aago da tema mbeu? 
that girl also student cop.EMPH.Q 


‘Is that woman also a student?’ (Chaolu Wu 1994a: 8, 6) 


The copula forms are given as weu and mbeu by Chaolu Wu (1994a). According to 
Fried (2010: 260), the forms are declarative wi SUBJ vs. wa OBJ, interrogative wu(u) SUBJ 
vs. wa-u OBJ and declarative bi suBy vs. ba OBJ, interrogative bu su8j vs. ba-u opt The 
copula starting with b- is used in nominal copula clauses, the copula starting with w- 
in all other clause types (Fried 2010: 260). In addition, the Gansu variety of Bonan has 
borrowed the Chinese polar question marker mal (Hugjiltu 2003: 343) and has a special 
marker -si, allegedly for "rhetorical questions", that can also mark alternative questions. 


(227) Bonan 


a. yecan jangatca nat*ə k'ar-si? 
PN how dance be.required-o 


“How should (one) dance Leru?' 
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b. pa [hkuto  ora-si asa ora-si] əsə med-o. 
1sc yesterday rain-Q NEG rain-Q NEG know-TERM 


'I don’t know [whether it rained or not yesterday]. (Fried 2010: 99, 227) 


An example of a plain alternative question was given by Chaolu Wu (228). Interest- 
ingly, only the first alternative has an overt question marker. A similar situation can be 
seen in Santa, Kangjia, and Mangghuer and is an areal feature. 


(228) Bonan 
ba en-sa o.d ka-san=wu, ta)-sa o.d kar-san? 
1sc this-ABL go be.required-».Prv-o that-aBL go be.required-».Prv 


"Will I go this way or that way ?' (Chaolu Wu 1994a: 15) 


In Bonan there is also the Chinese question marker ba IE. 


(229) Bonan 
dedo ‘guda sa edaro ba? 
old.man.voc yesterday NEG ?tired o 


‘Grandpa, you weren't too tired yesterday, right?’ (Buhe & Liuzhaoxiong 1982: 
59) 


In Kangjia the question marker has the form -# and has the two variants up and - 
bu. It fused with more suffixes than in Bonan resulting in the forms listed in Table 5.75. 
In addition there are also two markers ba, le, and sa that are most likely of Mandarin 
I 


Chinese origin (e.g., Mandarin ba ll, Xining le” WE, Hezhou za, see §5.9.2.1) As in Bonan, 
content questions are usually unmarked. 


Table 5.75: Special question forms in Kangjia (Siqinchaoketu 1999: passim; 2002: 


passim) 
Meaning Declarative Interrogative 
NFUT -m, -mu 
NFUT -na, ... -nu 
PST -va, -pa, -ba -vu 
PROGR -si, -sina -su, -sunu 
PROGR -dzi(na) -dzina 
PROGR (-d3i igu >) -dzigu -dzigu u 
PFV -sun -sun u/bu/vu 
IPFV -gu(n) -gu(n) u/bu 
HAB (-si-gu ») -s(u)gu -s(u)gu bu 
FUT (-gui » -gi) -gu(n)-i/a -gu(n) bu 
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(230) Kangjia 

a. tfi meda-na ba? 
2sc know-NFUT Q 
"You know, right?’ 

b. ajo! en.de ma-ge-d3i re-va sa? 
EXCL here how-v-CVvB.IPFV come-PST Q 
‘Oh! Why have you come here?’ 

c. tfi-ni kun le? 
2SG-GEN person Q 
‘What about your person?’ (Siqinchaoketu 1999: 215, 222, 217) 


Alternative questions have only one marker attached to the first alternative. 


(231) Kangjia 
a. tfi mede-nu? 
2sc know-NFUT.Q 
‘Do you know?’ 
b. te tfi-ni-gu bei 
that 2sG-GEN-?N Q 
‘Is that yours?’ 


c. re-vu, se re-va? 
come-PST.Q NEG come-PST 


‘Has (she) come or not?’ (Siqinchaoketu 2002: 71, 169, 217) 


Tag questions in Kangjia take the sentence-final marker ere ~ are. Note a formally 
similar tag question marker ale in the Turkic language Tuvan (§5.11.2). 


(232) Kangjia 
te kun lausu mari, ere? 
that person teacher NEG. Q 


‘That person isn't a teacher, right?’ (Siqinchaoketu 1999: 197) 
Kangjia has a copula system similar to Bonan (Table 5.76). 


Table 5.76: Special interrogative copula forms in Kangjia in analogy to Bonan 
(Kangjia/Bonan) (Siqinchaoketu 1999: 196f., 216, passim; 2002: passim). 


Subjective Objective Interrogative 
EXIST i wi va/wa vu/wu 
COP.EMPH mbi/mbi mba/mba mbu/mbu 
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Similar to Bonan and Kangjia, Santa preserves the Mongolic interrogative marker as 
-u, which also fused with the preceding verbal ending or copula (Table 5.77). There like- 
wise does not appear to be a "corrogative" particle. 


Table 5.77: Finite tense aspect markers in Santa (Kim 2003: 358; Napoli 2014: 
39); in Chaolu Wu (1994c) the interrogative forms are given as -nu and -wo-u 


Meaning Declarative Interrogative 
DUR -IPFV -ne -nu 
TERM -PFV -wo -wu 
PROGR -CVB.IPFV COP -zhi wo > -zho -zhi wu > -zhu 


(233) Santa 
a. ire-nu? 
come-DUR.Q 
"Will you come?’ 
b. kijie-zhi wu? 
sleep-CVB.IPFV COP.Q 
‘Is (s)he sleeping?’ 
c. chi khala echi-ne? 
2sc whither go-DUR 
"Where are you going?' (Kim 2003: 358) 
d. ghoni-chini ^ ire-wu? 
sheep-2sG.GEN come-TERM.Q 


‘Have your sheep come back?’ (Todaeva 1959: 284, modified transcription 
based on Kim 2003) 


The form -mu found in the following alternative question (234) was not mentioned by 
Kim but is probably comparable to an identical form in Bonan, the so-called narrative 
interrogative. In Santa as well, only the first alternative receives a question marker. 


(234) Santa 
bi ana man-sa jawu-mu ha man-sa jawu-na? 
1sc this direction-ABL go-Q that direction-ABL go-DUR 


"Will I go this way or that way?' (Chaolu Wu 1994c: 14) 


Todaeva (1959: 295), who did fieldwork among the Santa in the middle of the 50s, 


mentions two additional interrogative particles la and ba. The latter is clearly of Chinese 
I 


origin (ba I, Liu Zhaoxiong 1981: 83). 
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(235) Santa 


a. hhe shi noghi we-la? 
that cop dog cop-?Q 


‘Is that a dog?’ 


b. e.ne kun ede ire-ne ba? 
this man here come-DUR Q 


‘This man is coming, right?’ (Todaeva 1959: 295, modified transcription 
based on Kim 2003) 


Perhaps la is a loan from Hezhou Mandarin la? W (§5.9.2.1). Field (1997: 360) claims 
that Santa has tag questions that have the form of a regular polar question followed by 
the irrealis negator ulia, which is a very unexpected construction for a tag question. In 
fact, an analysis as a negative alternative question in which only the first alternative is 
overtly marked is more likely. Such a situation can also be found in Karlong Mongghul 
(see Faehndrich 2007: 221) and Minhe Mangghuer (see below). 


(236) Santa 
imani madzia=nu ulia? 
faith know=Q NEG 


‘Do you know the faith or not?’ (Field 1997: 360) 


The same construction is also mentioned by Liu Zhaoxiong (1981: 79). A slightly different 
analysis of the use of negators for question marking has recently been given by Napoli 
(2014: 41). According to him, there are two negators that can fulfill this function. 


Events marked with the non-perfective marker -ne can only receive the negator 
(u)lie. Since this negator can only be used with events marked with this marker, the 
finite marker can be dropped. This is not the case with wuye, since it can negate 
events marked with -wo and -zho. Therefore, in order to specify the tense-aspect 
of the sentence, the marker is obligatory. 


(237) Santa 
chi baza-de echi-wo wuye? 
2SG PN-LOC gO-TERM NEG 


"Did you go to Linxia or not?' (Napoli 2014: 41) 
However, other sources do not mention the negator wuye at all. Santa has a partially 


productive negative verb ui- (Liu Zhaoxiong 1981: 73; cf. S. Kim 2003: 362) that could be 
the basis for wuye. For instance, consider the following example. 


(238) Santa 
tsu-ni ada  uai-nu, u-wo? 
2sG-GEN father COP-Q | NEG- TERM 


‘Is your father alive or not?’ (Liu Zhaoxiong 1981: 105, simplified) 
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Mongghul has no *corrogative" particle in content questions (Stefan Georg p.c. 2015), 
but preserves the polar question marker. Faehndrich (2007) has collected several descrip- 
tions of question marking in Mongghul that exhibit certain differences but usually agree 
in the presence of three question markers such as neutral uu, nuu after objective -a, and 
juu after subjective -ii in Karlong. The descriptions disagree about the analysis of the 
question markers as particles or suffixes. Here, the original variant has been analyzed as 
enclitic =(y)uu and all other forms as suffixes. In Karlong Mongghul the forms are 


nu:, after words ending in the objective suffix -a, ju; after words ending in the 
subjective suffix -i:, and u; which is used after words ending in other vowels, in- 
cluding /a/ which is not the objective suffix. Short high vowels are deleted before 
the interrogative particle u:. (Faehndrich 2007: 221) 


In example (239b) it appears in a focus question and does not stand sentence-finally. 
It does not, however, attach to the apparent focus in the sentence, which is ege pisee 
‘big belt’. The situation is thus unlike Middle Mongol. Whether the focus position is 
sentence initial or preverbal remains unclear, but might be responsible for the sentence- 
final position of the personal pronoun. But see also §5.11.2 on Turkic languages for second 
person markers following the question marker. 


(239) Mongghul (Karlong) 
a. tci dziehun-la-uu? 
2SG marry-v-o 
"Are you married? 
b. ege pisee sge-dz-a-nuu tei? 
big belt see-PERF-OBJ=Q 2SG 
‘Have you seen a big belt?’ (Faehndrich 2007: 221, 220) 


(240) Mongghul 
qi anji xji-gu-i? 
2sG where go-P.FUT-SUBJ 


"Where are you going?' (Georg 2003a: 303, shortened) 


Akerman (2012) gives a much more elaborate description of the interrogative verb 
forms in Mongghul. Similar to other Shirongolic languages, the question marker fused 
with several verb suffixes and copulas (Table 5.78). There is complex interaction of ques- 
tion marking with the domains of tense, aspect, clause type, and perspective. Similar 
to Faehndrich's (2007) claim, the question marker -nu only appears after the objective 
forms. However, the interaction of question markers with the other suffixes is much 
more complicated. In some cases the question marker simply fused with the suffix, e.g. 
-wuu < -wa + -u. In other cases, those in which Faehndrich (2007) postulated the question 
marker -juu, the analysis is somewhat unclear. 

The question suffix -niu, for example, might be analyzable as -ni-u, but following 
Faehndrich an analysis as -n-iu is more likely. Only in some cases is there a question 
marker with a long vowel, e.g. -m-uu. 
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Table 5.78: Special interrogative suffixes and copulas in Mongghul (Akerman 


2012: 13ff.) 
Meaning Declarative Interrogative 
PFV SUBJ -wa -wuu 
IPFV SUBJ -nii -niu 
STATE SUBJ -jii -jiu 
FUT SUBJ -gu.nii, -gui -gu.niu, -guu 
IPFV NEUT -ni/-nu -nu 
IPFV NEUT -m/-n -muu 
FUT NEUT -m/-n -muu 
COP NEUT wei wei-u 
PFV OBJ -jia -jia-nu 
IPFV OBJ -na -na-nu 
FUT OBJ -gu.na -gu.na-nu 
COP OBJ wa-ma-na-la-ra  wa-nu- ma-nu ~ na-nu ~ la-u ~ ra-nu 


(241 Mongghul 


a. gi Mongghul uguo mudie-nu? 

2SG PN language know-1PrFv.Q 

‘Do you know (how to speak) Mongghul?' 
b. qi wazar Minyuen-de sou-niu? 

2sG city | PN-LOC Sit-SUBJ.IPFV.Q 

‘Do you live at Minyuan in the city?’ 
c. qi laosi ` wei-u? 

2sc teacher sUBJ.COP-Q 

"Are you a teacher?' 


d. nie-nu moxi shda-gu.na-nu? 
this-acc read can-OBJ.FUT-Q 


‘Can you read this?’ 


e. ngan Zhonggui kun na-nu? 
3SG PN person OBJ.COP-Q 


‘Is he Chinese?’ (Akerman 2012: 14) 


(242) Mongghul 


qi niudur xi-gu.niu, malang — xi-gu.niu? 
2sG today gO-FUT.SUBJ.Q tomorrow g0-FUT.SUBJ.Q 


‘Do you go today or tomorrow?’ (Akerman 2012: 14) 


Alternative questions take two question markers that have to be identical in form. 
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Probably the most aberrant Mongolic language with respect to the marking of ques- 
tions is Mangghuer. Instead of a simple particle there is a rather elaborate paradigm of 
forms which, as in Mongghul, includes the dimension of perspective (Table 5.79), typical 
for adjacent Tibetic languages (§5.9.2). Nevertheless, the suffixes marking polar ques- 
tions clearly contain the original interrogative particle. 


Table 5.79: Paradigm of question marking in Mangghuer (Slater 2003b: 316; 
Dixon 2012: 386f.) 


PFV IPFV FUT 
DECL or CQ SUBJ -ba -la bi -ni 

OBJ -jiang -lang -kunang 
PQ SUBJ -bu -la bi-u -nu 

OBJ -jinu -leinu -kuninu 


(243) Minhe Mangghuer 

a. bi ri-jinu? 
1SG come-PFV.OBJ.Q 
‘Did I come?’ 

b. qi ri-bu? 
2SG come-PFV.SUBJ.Q 
‘Did you come?’ 

c. gan ri-jinu? 
3sG come-PFV.OBJ.Q 
"Did (s)he come?' (Slater 2003a: 198) 


There is one example of a negative alternative question in which only the first alterna- 
tive receives question marking while the second takes a negative marker. In the original, 
nu was written detached from the preceding word. 


(244) Minhe Mangghuer 
ta ghula qijige kerli-nu lai-kerli? 
2PLtwo flower want=Q NEG-want 


‘Do you want two flowers or not?’ (Zhu Yongzhong et al. 1997: 437) 


As expected, there are also special interrogative forms of copulas, as shown in Ta- 
ble 5.80. Again, the original question marker can clearly be recognized, but is not com- 
pletely analyzable. 
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Table 5.80: Special interrogative copulas in Mangghuer (Slater 2003b: 318); 
Slater (2003a: 199) in addition has the variant meinu, which is identical to beinu 
in meaning; negative copulas in addition have special attributive forms, SUBJ 
(u)gui and oy (u)guang, while there are no such special forms for declarative 
and interrogative copulas 


Declarative Question Negative 
SUBJ bi biu puzhi 
OBJ bang beinu puzhang 


(245) Minhe Mangghuer 


a. bi laoshi meinu? 
1sc teacher COP.OBJ.Q 


‘Am Ia teacher?’ 

b. qi laoshi biu? 
2sc teacher COP.SUBJ.Q 
"Are you a teacher?' 


c. gan laoshi meinu? 
3sc teacher COP.OBJ.Q 


‘Ts (s)he a teacher?’ (Slater 2003a: 199) 
Content questions do not take the interrogative forms of copulas. 


(246) Minhe Mangghuer 
tasi ang=ji-ku-ni bi? 
2PL where-DIR-IPFV-N COP.SUBJ 


"Where are (all of) you from?' (Chen Zhaojun et al. 2005: 16) 


For the Halchighul dialect of Mangghuer Zhaonasitu (1981b: 61) mentions the markers 
ba (Chinese ba IE) and sa (perhaps Hezhou za?) with similar meanings. 

At a first glance the situation in Mongghul is very different from the other languages 
mentioned thus far, but this is partly due to the difference in description. In fact, Mong- 
ghul has a strikingly similar system that is given again in Table 5.81, following the analy- 
sis by Slater (2003b: 316) and Dixon (2012: 386f.) for Mangghuer. In fact, this new analysis 
allows us to analyze some of the forms further than would be possible otherwise. The 
so-called ^why-question" marker -ji in Mangghuer not shown in Table 5.81 might corre- 
spond to -jii 'STATE.SUBJ in Mongghul. 

The perspective neutral forms were left aside to make the system more comparable 
with Mangghuer. The two paradigms show both striking similarities and differences. Al- 
together, the interaction between the domains of perspective, aspect, and tense is almost 
identical. In general, however, the Mongghul forms are more readily analyzable. There 
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Table 5.81: Paradigm of question marking in Mongghul (Akerman 2012: 13ff.) 
in comparison with Mangghuer (Mongghul/Mangghuer) 


PFV IPFV FUT 
DECL or CQ ` sont -wa/-ba -nii/-la bi -gu-nii/-ni 
OBJ -jia/-jiang -na/-lang -gu-na/-kunang 
PQ SUBJ -wuu/-bu -niu/-la bi-u — -gu-niu/-nu 
OBJ -jia-nu/-jinu  -na-nu/-leinu ` -gu-na-nu/-kuninu 


are slight phonological changes as can be seen from correspondences such as Mang- 
ghuer -ba and Monggul -wa. Mangghuer has apparently innovative imperfective forms 
that are a combination of the copula bi, interrogative bi-u (Mongghul neutral copula wei, 
wei-u), and a so-called “imperfective auxiliary linker" -la (Slater 2003a: 143) that might 
correspond to the verbal purposive suffix -/a in Mongghul often found before auxiliaries 
(Akerman 2012: passim). The unexpected objective imperfective forms -lang and -leinu, 
corresponding to Mongghul -na and -na-nu, have been contaminated by -la but are pre- 
served in the future. In Mongghul the future forms are still identical to the imperfective 
forms, except for the future participle marker -gu (Georg 2003a: 300). In Mangghuer - 
ku is restricted to the objective forms. This parallel allows an at least historically valid 
analysis of the two Mangghuer forms into -ku-nang (Mongghul -gu-na) and -ku-ni-nu 
(Mongghul -gu-na-nu). 

In sum, for most Mongolic languages the information given for question marking in 
grammatical descriptions is not sufficient for a full typology. Table 5.82 summarizes the 
different interrogative marking strategies in Mongolic languages for polar and content 
questions, exclusively. From Table 5.82 it becomes apparent that the internal diversity 
of interrogative particles within Mongolic is less pronounced than, for example, Tun- 
gusic. In fact, Mongolic languages may be classified into four groups according to their 
polar question markers. Most languages preserve the original polar question marker 
-UU. Dagur has the form -yee instead, Moghol apparently lacks any morphosyntactic 
question marker, and Buryat, together with Khamnigan Mongol, probably borrowed the 
marker from an Ewenic (Tungusic) language. Shirongolic also forms a subgroup for it- 
self because in all the languages the question marker fused with other elements, which 
results in a much more complicated situation. The interrogative marker -mu has, accord- 
ing to Sandman (2012: 384), been borrowed from Bonan into the Sinitic language Wutun, 
but a Turkic origin is more likely (see $8 5.9.2.1, 5.11.2, 6.1). 

Shirongolic languages also have special interrogative forms for copulas that are given 
in Table 5.83. 

Some of the forms are still analyzable into a copula and a question marker, e.g. Mong- 
ghul wei-u. In other languages such a situation may have existed before phonetic erosion 
and contraction set in, e.g. Bonan wi + -u = wu. Monggul and Mangghuer have differ- 
ent copula forms depending on the category of perspective, while in some variants of 
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Table 5.82: Overview of polar and content question markers in Mongolic lan- 
guages; intonation patterns are excluded 


Language PQ CQ 

Dagur =yee# =yee# 

Tacheng Dagur SEI: -jA# 

Moghol - - 

Khamnigan Mongol =gu# bei# 

Buryat =gü# be ~ =b# 

Shineken Buryat =go#/=gu#/=g be ~ =b# 

Mongolian =(y)UU# (Table 5.72), =(y)ii# bwii# ~ bwai#, V-b ~ V-e.b, ? 
=(y)UU# 

Khorchin (j)UU#, =(j)ii#, =mu#, ?=me# (j)ii#, be#, =me# 

Oirat =(y)UU#, =ii# =b# ~ =w# 

Kalmyk -uft, =iy# ~ =i# =b# ~ =w# 

Ordos =UU# - 

Shira Yughur -(j)juus bag, ?=(j)uu# 

Mongghul =uu, (Tables 5.78, 5.79) - 

Mangghuer =u, (Tables 5.80, 5.81) - 

Bonan V-u, (Table 5.73), V-si, ma# - 

Kangjia Va (Table 5.75), sa# -, le# 

Santa V-u, (Table 5.77), las ? 


Table 5.83: Special interrogative copulas in Shirongolic languages 


Declarative Question 
Bonan (Wu) wi SUBJ, wa OBJ, mbi suBJ, mba OBJ wu, mbu 
Kangjia i SUBJ, va OBJ, mbi suBJ, mba OBJ vu, mbu 
Bonan (Fried) wi suBJ, wa OBJ, bi suBJ, ba OBJ wu(u) SUBJ, wa-u OBJ, bu suBJ, ba-u OBJ 
Santa wo wu 
Mongghul wei NEUT, Wa, ... OBJ Wei-u NEUT, wa-nu, ... OBJ 
Mangghuer bi sugj, bang opt bi-u sont, beinu opt 
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Bonan, Kangjia, and Santa this difference is leveled in the interrogative. Special interrog- 
ative forms of copulas are also known from Japonic (Shuri, §5.6.2) and Ainuic (§5.1.2). 


5.8.3 Interrogatives in Mongolic 


There are few good descriptions of interrogatives in Mongolic. Most treatments such 
as those in Janhunen (2003e) mention only a handful of forms and leave them mostly 
unanalyzed. Most grammatical descriptions for Mongolic languages also do not mention 
the syntactic behavior of interrogatives. But they seem to generally remain in situ (e.g., 
Fried 2010: 134; Napoli 2014: 40). Nevertheless, there are quite reliable reconstructions 
for Proto-Mongolic by Janhunen (2003d: 20) that can serve as a basis for further analysis 
(Table 5.84, see also Poppe 1955: 229f.). 


Table 5.84: Proto-Mongolic reconstructions by Janhunen (2003d: 20) and their 
modern Mongolian correspondences (Janhunen 2012b: 130ff.; Benjamin Brosig 


p.c. 2018) 
Meaning Analysis Proto-Mongolic Mongolian 
who *ke/n xen 
who PL PL -d *ke-d xed 
when *ke.li WM keli 
how *ke.r xer 
how many GEN of *ke-d [?] *ke.d.ü.n xedn 
how much acc of *ke-d [?] *ke.d.ü.(y)i xedii 
when LOC "A *ke.d.i.x-e > *ke.j.i.x-e [?] xedzee 
where LOC *-A *ka.mix-a > *kaxa/n-a [?] xaan’ 
what “yan WM yan 
what *ya.xu/n yuun 
what kind of *ya.m.bar » *yamar yamer 
what thing *ya.xu.ma » "yexüme youm 
to do what *ki- ‘to do’ “ya.xa+ki- > *yaxa- [?] yaa- 
which *ali/n alyn 


Some developments assumed by Janhunen are marked with a question mark as they 
are not very plausible. There is evidence of a form *kamixa ‘where’ in some languages 
such as Written Oirat xamigha(a). In Middle Mongol, a form xamiya is attested twice 
(Benjamin Brosig p.c. 2018). Locative interrogatives are not derived from the selective 
interrogative as in Tungusic but nevertheless display parallels with the demonstratives 
(Table 5.85). The Turkic language Dolgan has a form kanna ‘where, whither’, anda related 
form xanna is found in Yakut. There are surprisingly similar forms in Mongolic, e.g. 
Khamnigan Mongol kaana or Buryat xaana ‘where’. But the Yakut and Dolgan forms 
are contractions of an interrogative that is still analyzable in other Turkic languages 
including Sarig Yughur qan-ta ‘which-Loc’ (§5.11.3). 
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Table 5.85: Spatial deictics in Mongolian according to Janhunen (2012b: 131), 


slightly reduced 
PROX (hearer) DIST INT 
LOC naa-n’ tzaa-n’ xaa-(n’) 
LOC ABL naa-n-aas tzaa-n-aas xaa-n-aas 
LAT naa-sh tzaa-sh xaa-sh 
PROL naa-g.oor tzaa-g.oor xaa-g.oor 


Table 5.86 shows five of the interrogatives that can be found in most modern Mongolic 


languages. 


Table 5.86: Five Proto-Mongolic interrogatives and their modern representa- 


tives 

*ken *yaxun *alin *kejixe “kaxana 

‘who’ ‘what’ ‘which’ ‘when’ ‘where’ 
Dagur xeng yoon aly xejee xaan 
Mongolian xen yuun alyn xedzee xaan’ 
Buryat xen yüün ali xezee xaana 
Khamnigan Mongol ken yeen ali kejie kaana 
Ordos ken yüün ali kejee kaa 
Written Oirat ken you/n ali kezee xamigha(a) 
Oirat ken yuu/n al ~ àl-k keze xamaa 
Kalmyk ken yuun aly(-k) keza xama, 

aly-d 
Shira Yughur ken yima aali kejee xana 
Santa kien yang ali giezhe khala 
Bonan kang yang ane kece(-) hala 
Kangjia ko jo ^ jag ani(ye) gadze yana 
Huzhu Mongghul ken ya(a)n ali kijee an-ji(i) 
Minhe Mangghuer kan ya, yang a(yi)ge kejie ang(ji) 
Moghol ken-kiyan emah ~ ? keja ? 
imas etc. 


According to Janhunen (2003d: 20) the stem *ke- originally had the meaning ‘who’ as 
well as ‘what’, which is an unlikely scenario from a cross-lingusitic point of view. As 
has been shown by Cysouw (2005), the only place worldwide where this pattern is not 
extremely rare or altogether absent is South America. 

Proto-Mongolic had two resonances (submorphemes), one in *k- that is still present in 
most Mongolic languages but changed to x- in Dagur, Buryat and Mongolian, and one in 
*y- that has survived up to today. Similar changes from “k~ to > *x- can be seen in Turkic 
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(e.g., Khakas, §5.11.3) or Tungusic languages (e.g., Nanai, §5.10.3). Only the interrogative 
*ali/n ‘which’ does not fit into either type. All Mongolic languages thus have what has 
been called K-interrogatives in this study. Furthermore, Mongolic also possesses the KIN- 
interrogative. Amuric (85.2.3) and especially Tungusic languages (§5.10.3) exhibit several 
interrogatives that may have been borrowed from Mongolic. 

In the following, I will address interrogatives in individual Mongolic languages in 
turn. Table 5.87 summarizes the interrogatives found in four descriptions of Dagur. The 
etymology of most of these forms has already been given above. 


Table 5.87: Interrogatives in Dagur (Martin 1961: 30f., passim; Zhong Suchun 
1982: 52; Chaolu Wu 1996: 22; Tsumagari 2003: 141f.; Yu Wonsoo et al. 2008: 63, 


passim) 
Martin Zhong Chaolu Tsumagari Yuet al. 
which ali, alin alj ali aly - alin- ali - aalj 
(attr.) 
who hen xən, anii xon,aniin  xeng, anija ~ anja 
aniing 
what joo joo joo yoo ~ jo, jooke, 
yoon- joon 
how many/much jookee jookaa yookie ~ jooked 
yiekie 
why iuuu juguu yoondaa, jugoo 
yuguu 
what (kind of) iamare ~ jamər jamər yamer jamər 
iamere 
how many hede, xadan xod xed xod, xadii, 
heden xodon 
(attr.) (attr.) 
when hejee xodzee xodzoo xejee xajaa, 
xojoor 
how here Xor xer, Xoroo 
yoondaa 
where haane xaano xaano xaan xaan 
whither xaidaa xaidaa xaan, 
xandiin 
whence xaanaar 


Chaolu Wu (1996) also mentions an interrogative verb jee- ‘to do what’. The form iuuu 
~ juguu etc. ‘how, why’ may be cognate with Buryat, Chakhar, Khalkha, and Khamni- 
gan yaa/g-aad ‘how, why’, which is a perfective converb form of an interrogative verb 
that has the form -g/-AA(r) > *-g/-AAd in Dagur (Tsumagari 2003: 145f.). The medial -g- 
may either be part of the converb or, less likely, of the verbalizer that has the form -ge 
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in Buryat or Khamnigan. The resonance *k~ changed to *x~ in Dagur, but not in all di- 
alects. The change did not take place in the Oiqihar dialect, which has forms such as kuu 
‘person’ or kar ‘how’ as opposed to xuu and xər in other dialects (Ding Danqing 1995: 
191). Dagur preserved the original interrogative xan ‘who’ but also has an innovative 
form anii that ultimately might be somehow related to “ali(n) ‘which’. Similarly, the two 
Tungusic contact languages of Dagur, Oroqen and Solon have a form a(a)wu, which orig- 
inally meant ‘which one’ but has extended its meaning and has partly replaced the form 
ni(i) ‘who’ that goes back to Proto-Tungusic (§5.10.3). If true, this could be an instance of 
a shared grammaticalization. But the exact etymology of the Dagur interrogative is not 
entirely clear. The suffix in yoon-daa is likely a dative ending followed by the reflexive 
marker, i.e -d-AA (Tsumagari 2003: 143). This is also the analysis of the form joon-d-ee 
found among the Dagur interrogative paradigms collected by Martin (1961: 30). Martin 
also lists a plain dative form joon-de, which is likely the source of Nanmu Oroqen joonde 
and could also somehow be connected with Solon yoodon. 

The interrogatives in Khamnigan Mongol and Buryat (Table 5.88) are very similar. 
Khamnigan has a more conservative phonology and preserves the initial *k~, which 
changed to x~ in Buryat. Interrogatives with the meaning ‘why’ and ‘how’ are derived 
with the help of the same case and converbial markers in both languages. Yamakoshi 
(2007a) mentions a Khamnigan form kadui cag- ‘what time’, which is probably a loan 
translation of Mandarin ji didn (§5.9.3.1). Castrén (18572) collected several paradigms of 
interrogatives and demonstratives that are given in a simplified and analyzed version in 
Table 5.89. The paradigms are clearly identical in their case forms, but the demonstratives 
take an additional stem augmentation /n. 

As in Dagur and other languages below the dative case form of the interrogative 
meaning ‘what’ has acquired the meaning ‘why’. For comparison, Table 5.90 lists Dagur 
demonstrative and interrogative paradigms, but excludes reflexive case markers. There 
are some differences in phonology and morphology such as the lack of the /n in several 
Dagur forms. However, given the overall similarity of paradigms, these will not be listed 
for all languages below. 

In Mongolian the same change from *k- to x~ as in Buryat occured (Table 5.92). Ac- 
cording to Mostaert's account of Ordos (Table 5.93), the initial k- is preserved except for 
yaa ‘where’. 

There are additional forms such as Khorchin joo ga-j ‘what say-CvB.IPFV > why’, which 
is completely parallel to Manchu ai se-me, and a Mongolian interrogative verb xaa-c- 
(< xaa-oc-) ‘to go where’. The semantic scope of yamer ‘what kind of, how’ suggests a 
connection with Turkic languages (§5.11.3). Chakhar gecneeng has a cognate in Ordos 
ce ‘ts ‘ineen and Cyrillic Khalkha xecneen (Benjamin Brosig p.c. 2016). Instead of xedii 
Khalkha usually has the complex form xer olon ‘how much’ (Benjamin Brosig p.c. 2016), 
which might be a calque of a common European formation transmitted via Russian kak 
mnogo/Kak MHoro. Georg (2003b: 202) only mentions a few forms for Ordos (ken ‘who’, 
gecineen ‘how much’, kejee ‘when’, kaa ‘where’, yüü/n ‘what’, and yamar ‘what kind of’). 
But in his list kaa still preserves the initial k-. 
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Table 5.88: Interrogatives in Buryat (Yamakoshi 2011a: 170; Skribnik 2003: 
111) and Khamnigan Mongol (Janhunen 2003b: 92; Yamakoshi 2007a: passim); 
Buryat also has xedii-dexi ‘how manieth' and xedii-lüülen ‘in a group of how 
many’; some variants were excluded 


Shineken Standard Khamnigan Khamnigan 
Buryat Buryat (Janhunen)  (Yamakoshi) 
who xe/n xe/n ken kon 
how many xedii xedii.n- kodui 
when xezee xezee kejie kozie 
how xer ker 
where xaa-(na) xaa-(na) kaa-na kaa-na 
whither xaa-s! xai-sha kaa-si kaa-s 
whence xaana-haa 
through where xaa-g-oor xaa/g-uur 
what kind of jamar yamar yamar jamar 
what jun, juu yüü/n yee/n- yuu/n jun, joon 
why (what-DpAT) juun-de yüün-de yeen-du joon-do 
to do what(what-v-) jaa- yaa-(ge-) yaa-(g-) joo ki- 
how, why (IpFv.cvB)  jaa-zia yaa-zha 
how, why (PFv.cvB) yaa-g-aad yaa-g-aad 
which ali ali ali 


Table 5.89: Simplified paradigms of interrogatives and demonstratives in 
Buryat according to Castrén (1857a: 31ff.); only singular forms and not all vari- 


ants are shown 


this that who what which 
plain e.né te.ré ke/n ju/n ali/n 
ACC ené/n-i teré/n-i ken-i jun-i alin-i 
DAT ené/n-de teré/n-de ken-de jun-de alin-da 
ABL ené/n-ehe teré/n-ehe ken-ehe jun-ehe alin-aha 
INST ené/n-er teré/n-er ken-er jun-er alin-ar 
COM ené/n-tei teré/n-tei ken-tei jun-tei alin-tai 
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Table 5.90: Paradigms of interrogatives and demonstratives in Dagur according 
to Martin (1961: 28ff.) in analogy to Table 5.89. 


this that who which 
plain e.ne te.re he/n ali 
ACC e.n-ii te.r-ii hen-ii joon-ii ali-i 
DAT e.n-de te(.re)/n-de hen-de joon-de alin-de 
ABL e.n-eese te.r-eese hen-eese joon-oose ali-eese 
INST e.n-eere te.r-eere hen-eere joon-oore ali-eere 
COM e.n-tei te.re-tei hen-tei joon-tei ali-tei 
Table 5.91: Interrogatives in Mongolian (Janhunen 2012b: 130ff., 255f.) and in 
Chakhar (Sechenbaatar 2003), Darkhat (Gáspár 2006: 46), and Khorchin di- 
alects (Yamakoshi 2015: passim); not all forms and variants are listed 
Mongolian Chakhar Khorchin 
who xe/n xeng - xen- xon 
how many xed//n xed, xede-ng (attr.) xodon 
how much xedii xedii xodii, xodoo 
when xedzee xejee, xediis xajaa 
how xer 
where xaa(-n’) xaa(-na=n) xaa 
what yuu/n yuu, yuu/n joo 
why (DAT) yuun-d 
how, by what yuu-geer 
means (INST) 
what kind of, yamer yamar jamar 
how 
to do what yaa-, yuu xii- jaa(-x)-, joo xii- 
how (CVB.IPFV) yaa-j jaa-j 
why (CVB.PFV) yaa-gaad 
how much/ gecneeng 
many 
which aly//n aly ~ aly/n ~ el’ 
alya/n 
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Similar to Ordos, in both Oirat and Kalmyk the *k- remained stable in the stem *ke- 
but changed to x in the stem *kaa- (Table 5.92). The same is true for Shira Yughur and 
maybe for Santa as well (see below). The form aly-d ‘where’ in Kalmyk clearly is a loca- 
tive (dative) form of the interrogative aly ‘which’. Spoken Oirat äl-k and Kalmyk aly-k 
in all likelihood have the same origin as Mangghuer ali-ge. Instead of Kalmyk xamaran 
‘whither’ my informant employed the form al’daran, based on aly ‘which’. 


Table 5.92: Interrogatives in Oirat and Kalmyk (Birtalan 2003: 220; Blasing 


2003: 239) 
Written Oirat Spoken Oirat Kalmyk 

who ken ken ken 
when kezee keze keza 
how much/many kedüi ~ kedüü kedn- kedü ` kedü ~ kedü/n- 
what you/n yuu/n yuun 
to do what yagh- 
why (-ad cvs.prv) yagh-ad 
what kind of yamaaru yamr/n yamr - yamaran 
where (-d roc) xamigha ~ xamighaa xamaa xama, aly-d 
whither xamaran 
which ali al ~ al-k aly ~ aly-k 


In Shira Yughur, however, roughly half of the interrogatives show a resonance in y-. 
Most interrogatives are either inherited from Proto-Mongolic or have a straightforward 
explanation such as a contraction with a following verb or the presence of a case marker. 
Only the form yima ‘what’ clearly differs from Mongolian. Its explanation is probably 
related to the change of meaning of the interrogative yaan from ‘what’ to ‘how’. 

The Bonan interrogative yala ‘where’ similar to Santa khala has a liquid l instead of 
a nasal n (cf. Mongolian xaan’). Whether ana ‘which’ is cognate with Mongolian alyn 
‘which’ or rather Dagur anii ‘who’ remains unclear to me. The forms jant"oy and yamtig 
from the two different descriptions probably represent dialectal variants of one and the 
same interrogative. The interrogative yamten'ge ‘how much’ is said to result from a fu- 
sion with the numeral nege ‘one’. This development of the numeral ‘one’ appears to have 
been influenced by Tibetic (883.5, 5.9.3.2). Both ‘how’ and ‘why’ are clearly based on the 
interrogative verb jan-ga-, which in turn is transparently derived from jan- ‘what’. 

Kangjia has the same derivation jan-gi- ‘to do what’ but the stem jan ‘what’, possibly 
in analogy to ko ‘who’, also has the alternative form jo ‘what’. As opposed to Bonan, 
but similar to Santa, there is an interrogative stem ma-. As opposed to Bonan yamten’ge 
‘how much’, Kangjia has ma-tu niye that is derived from matu but is likewise based on 
the numeral ‘one’. 

The origin of the Santa interrogative dzidzian-da is Chinese ji-dián ‘what time’. But 
it contains an autochthonous dative (locative) marker -da that is also present in the 
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Table 5.93: Interrogatives in Shira Yughur (Nugteren 2003: 273; Zhaonasitu 
1981a: 27, passim), and Ordos (Mostaert 1937: passim) 


Shira Yughur Ordos 

Nugteren Zhaonasitu 
who ken ken ken 
what juu, juun 
how much (niye keedi keedo niye ke‘p"ii, ce‘tS‘ineen 
one) 
how many keden keedo ke dm 
when kejee kedzee ke päee 
where xana yana yaa(-na) 
whither xagshi xacfo 
whence xanasa 
to go where xanad- 
("kaana od- >) 
how (INST) yaan juu-gaar 
why (DAT, CVB.PFV)  yaan-di jaan-do juun-py, juundaan, jaay*yuun 
what yima ima 
what kind of, how yimar imar jamar 
to do how yaa-gi- ima-yo- 
to do what yima-gi- jaa-(x*xi-) 
to happen how yimal- 
(*yama bol-) 
which aali aalo ali 


complex expression ali orn-da, which literally means ‘at which place’ and has parallels 
in several languages such as Mandarin Chinese zai shénme difang ‘(cop.)Loc what place’ 
but also Kangjia ani satfa. Another loan from Mandarin is the second part of yan shihou 
‘what time’ (Mandarin shihou ‘time’) that is also present in Kangjia ani-ye sury-du. 

The form matu ‘how’ looks very untypical for Mongolic. According to Siqinchaoketu 
(1999: 194), Kangjia ma- is an abbreviated form of jama, which seems possible but is 
in need of further explanation. A more plausible alternative would be a Sinitic origin, 
e.g. Mandarin mà (§5.9.3.1). The second part of ma-tu could be a derivational suffix that 
attaches to nouns to form adjectives (Kim 2003: 352). It may be noted that in Kangjia 
the suffix -tu is optional in the interrogative verb ma(-tu)-gi- ‘to do how’. Interestingly, 
in Santa matu is usually followed by the verb gie- ‘to say, to make, to think’ unless it 
has the form ma-tu-kan. In fact, Santa ma-tu gie- looks suspiciously similar to Kangjia 
ma-tu-gi-. The suffix Fon possibly derives nouns from adjectives (-ghang in Kim 2003: 
352), which would explain why it is followed by a copula in the following example. 
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Table 5.94: Interrogatives in Bonan (Fried 2010: 144, 261; Hugjiltu 2003: 337, 
342) and Kangjia (Siqinchaoketu 1999: 185ff., passim; 2002: 73); see also Todaeva 


(1963: 187) 

Bonan Kangjia 
Fried Hugjiltu 

who Kan kang ko 

whose (-gar Poss) k^ag-gar) kang-g(h)ang 

how many khatho kudung - gudo ~ gado 

kutung 

which (niye ‘one’) ano ane ani, aniye 

when (-du DAT) aniye suryou-du 

where (satfa ‘place’) ani satfa 

what thing jama(-su(n)) 

where xala hala yana 

whence yana-sa(la) 

when Kate ~ kece - kece- gədze 
k'otco-yar xangnang 

what (=ga sG.IND) jan ~ jan-go yang jo ~ jay 

to do what (-ga/-ge/-gi v) jan-ge- yang-ge- jay-gi- 

how (-t¢a/-je CVB.IPFV) jan-ge-tee yang-ge-je 

why (-say P.PFV, ?-da DAT) jan-ge-say yang-ge-da 

what kind of jant^ox yamtig 

how much (nege/niye one) yamten'ge ma-tu niye 

how (-tu ADJ) ma-tu 


to do how (-gi v) 


ma-tu-gi- ~ ma-gi- 


(247) Santa 


a. tci ma.tu gie-wo? 
2sG how do-TERM 


'How are you doing?' 


b. an.udu.ku tcientci ma.tu.kar wo? 
weather how 


today 


COP 


'How is the weather today?' (Chaolu Wu 1994c: 17) 


In Todaeva (1959: 288) we find an additonal form ma.tu.n-ni ‘what kind of" with a third 
person possessive ending. The complex interrogative yan gie-zhi ‘why, what for’ can be 
analyzed as ‘what do/say-cvB.IPFV’ and is a parallel to Bonan yang-ge-je. Not mentioned 
in Table 5.95 are plural forms such as jan-la (Ma Guoliang & Liu Zhaoxiong 1986: 174), 
which carries the special Santa plural marker of unclear origin. The initial *k has three 
or four different reflexes (k, g, and q ~ kh). It may be noted that today in both Santa and 
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Table 5.95: Interrogatives in Santa (Chaolu Wu 1994c: passim; Kim 2003: 356; 
Napoli 2014: 40); see also Field (1997: 360); not all variants are listed 


Chaolu Wu Kim Napoli 
who kion kien kien 
how much/many giodun, giadu-pan giedun giedun 
when giezhe giezhe 
where qala khala khala 
whence khala-se 
what jay, ja yang yan 
why (gie-zhi ‘do/say-CVB.IPFV ) yan gie-zhi 
which ali ali 
where (-da DAT) ali orn-da 
how (gie- ‘to do/say’) matu gie-, matu-kay matu gie- 
when, what time dzidzion-do yan shihou 


Kangjia the interrogative ^who' is the only interrogative starting with a k-, for which 
there may well be functional rather than phonological reasons ($6). 

The majority of interrogatives in Mongghul start with a-, several with k- and only 
one or two with y- (Table 5.96). Faehndrich (2007: 127) mentions one form ticaan ‘how 
many' that seems to have been borrowed from an unknown source. For the Halchighul 
dialect of Mongghul, Schróder (1964: 151) lists the interrogatives kan ‘who’, yan ‘what’, 
ali ‘which’, and kidi ‘how much’. 


Table 5.96: Interrogatives in Huzhu Mongghul (Chaolu Wu 1994b: passim; 


Faehndrich 2007: 127; Dpal-ldan-bkra-shis et al. 1996: passim) 


Chaolu Wu Faehndrich Dpal-ldan-bkra-shis 
who ken kani SUBJ, kana opt ken 
how much/many kidi, ticaan kidi-hangi 
when (-di DAT) kedzee kcee, ali-sxuu-di kijee, ali sghuu 
which ale ali 


which one 
what kind of 
how, why 
where 

what 


amaca, amarr- 
andzii 
jaan 


amagidza 
andzii SUBJ, andza OBJ 
jaanii SUBJ, jaana OBJ 


alinga, alingi 
amahgi sanba 
amaga, amakiji 
anji(i) 

yan, yanna 
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The second part of amahgi sanba ‘what kind of’ mentioned by Dpal-ldan-bkra-shis 
et al. (1996: 232, 241) means ‘kind, type, pattern’. The form ali sghuu/ali-sxuu-di liter- 
ally means ‘at which time’. A speciality of Mongghul is the presence of a perspective 
distinction in several interrogatives, which was only mentioned by Faehndrich (2007). 


Table 5.97: Interrogatives in Minhe Mangghuer (Slater 2003a: 55, 86; Dpal-ldan- 
bkra-shis et al. 1996: passim) 


Slater Dpal-ldan-bkra-shis 
who kan kan 
how much/many kedu 
when kejie 
why, how (=la INST, =ji DIR) ya-la, ya=ji, ya ya-la, ya-ge, ya-ji 
what yang, ya yang 
which (one) ayi-ge ali, a-ge, ali-ge 
what kind of amer-da yamer(-da) 
where (from yang?) ang ang-ji 


Neither Slater (2003a,b), nor Dpal-ldan-bkra-shis et al. (1996) give a clear analysis of 
the Mangghuer interrogatives (Table 5.97). But kan, kedu, kejie, yang, ali, and yamer are 
clearly of Proto-Mongolic origin. The form angji ‘where’, also present in Mongghul as 
anjii, probably contains a case ending that was given as a directive =ji by Slater (2003b: 
312) and is specific to Mangghuer. Problematically, it expresses only direction but not 
location, for which there is the dative/locative =du. The form ya-ji ‘why’ thus literally 
means ‘where to’ (cf. English to what end). The comparison of the two forms amerda and 
yamerda (both with an unclear suffix -da) with and without initial approximant suggest 
that the form ang might be a variant of the interroagtive yang ‘what’. The interrogative 
ayige or age probably contains the indefinite singular marker -ge, which is either derived 
from the Chinese classifier ge “ (via the loan yige ‘one’, from yí-gé 'one-crr"), or from the 
autochthonous numeral nige ‘one’ (cf. Slater 20032: 100). This analysis is corroborated by 
the form ali-ge ‘which one’. But the first part ayi- or a- remains unclear from a language- 
internal perspective. Most likely it has been borrowed from a Sinitic language (§5.9.3.1). 
The corresponding form in Mandarin is ná(-yi)-ge with or without the numeral ‘one’. 
This may explain the difference between ayi-ge and a-ge. In Sinitic languages of the area 
the interrogative lost the initial nasal, e.g. Hezhou/Linxia a-3i74-ga ~ a-ji?*-go. 
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5.9 Trans-Himalayan 


5.9.1 Classification of Trans-Himalayan 


This study includes languages from three of the 42 subbranches of Trans-Himalayan (van 
Driem 2014: 10). These branches are Sinitic, Bodish, and Qiangic. Of Bodish, only the Ti- 
betic subbranch will be included here. Glottolog (Hammarstróm et al. 2016) mentions 475 
Trans-Himalayan languages, which are almost all located to the south of NEA. The ex- 
act relation of the individual subbranches is somewhat disputed and is of no particular 
concern here. Despite certain controversies (Kurpaska 2010: 25-62), Sinitic is usually di- 
vided into seven often mutually incomprehensible main dialect areas called Gan (gan $&), 
Hakka (kéjia £X), Mandarin (guàn =), Min (min È), Wu (wú 3), Xiang (xiang 8), and 
Yue (yué $). The existence of separate Pinghua (pinghud YEW), Jin (jin &), and Hui (hui 
Sr dialects is somewhat disputed. Of these, Mandarin is the largest and, if one includes 
Jin and Hui within it, the only one located in NEA. Recent migrations of speakers of other 
dialects will not be considered. Mandarin itself may be classified into several regional 
varieties, of which the Southeastern area in and around Sichuan, Chongqing, Guizhou, 
and Yunnan as well as the Jianghuai subdialect area around the lower Yangtze are mostly 
excluded from this study. An exhaustive overview of questions in all the remaining Man- 
darin subdialects is impossible to give because of a lack of high quality materials. The 
focus will lie on a description of Standard Mandarin, to which will be added a sample 
of regional dialects for which good data was available, especially for interrogatives. Spe- 
cial cases to be addressed are Dungan (donggan KT, dunganskij/nynrancxuii), Gangou 
(gangou H 14), Hezhou (hézhou 18] JM) or Linxia (línxià mZ), Tangwang (tángwang F& 
T£), and Wutun (wütün 11.15). With the exception of Dungan, spoken in several Central 
Asian countries, but derived from Northwest China, these languages are all spoken in 
the Qinghai-Gansu area and exhibit a strong influence from Turkic, Mongolic, or Tibetic. 

Tibetic alone encompasses about 200 different varieties, divided into eight different 
groups or sections (Tournadre 2005; 2014). Here only Amdo Tibetan (anduo % £) di- 
alects and gSerpa (sé’érba &/f 3l) from the northeastern, as well as Baima (báimá A 
44), Cone (Chone, zhuoní JE), and Zhongu from the eastern section will be included. 
I currently lack sufficient data for some varieties such as Khalong Tibetan from the east- 
ern section spoken in northern Sichuan (see Sun 2007).? Both Zhongu and Baima are 
sometimes considered Qiangic instead of Tibetic. Amdo Tibetan is one of the more dom- 
inant languages of the Amdo Sprachbund (Sandman & Simon 2016) and is said to have at 
least 23 different subdialects (Ebihara 2011: 43). Given the limited amount of information 
available, only a fraction of this variation can be included here. Of the disputed Qiangic 
branch of Trans-Himalayan, only the extinct Tangut language, the major language of the 
Xixia empire (1038-1227), was located in NEA. The language was only rediscovered and 
deciphered in the 20 century (Gong Hwang-Cherng 2003). 

Descriptions of Chinese dialects frequently suffer from a lack of accuracy and analysis, 
the use of characters for transcription, and an incoherent use of characters for etymolog- 


??There seems to be no official Chinese name for Zhongu or Khalong yet. 
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ical and phonological purposes. This unfortunate and confusing situation could only be 
partly remedied here. Chinese dialectal or historical data that was available in Chinese 
characters exclusively will be transcribed with the official romanization system Pinyin 
without tones in square brackets. 


5.9.2 Question marking in Trans-Himalayan 
5.9.2.1 Question marking in Sinitic 


Old Chinese had a sentence-final polar question marker 2. [hu] (see 248) that has been 
reconstructed as *c'a by Baxter & Sagart (2014b). 


(248) Old Chinese 
Bee IRF 
[xian zheyi le ci hu] 
virtuous N also enjoy this Q 


‘Does a man of virtue also enjoy such (things)?’ (Pulleyblank 1995: 139) 


Its form is somewhat reminiscent of interrogatives that will be described in the next 
section. There are several other question markers that appear to be contractions of [hu] 
F with other elements and will not be discussed any further here (see Pulleyblank 1995: 
139ff.). Content questions were generally unmarked. 


(249) Old Chinese 
fep Al) 
[he bi yue li] 
why must say profit 


"Why must you say "profit"? (Pulleyblank 1995: 145) 


Of course, there is a lot of variation to be found in historical stages of Chinese, but a de- 
tailed examination of diachronic developments that necessarily also includes all modern 
Sinitic languages goes well beyond the possibilities of this study. Instead, the following 
will address exclusively the modern Sinitic languages located in NEA. 

Standard Mandarin Chinese data are partly based on my own knowledge and were 
partly elicited or confirmed in 2015 and 2016 with the help of a native speaker from 
Guiyang, fluent in both the dialect and standard Mandarin, living in Germany. Man- 
darin Chinese has many different question marking strategies, but the default form is 
the sentence-final marker ma D that marks polar questions. 


(250) Mandarin 
chi-fan le ma? 
eat-food PFV Q 


‘Have you eaten yet?’ 
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Polar questions may also simply be marked with rising intonation, though this seems 
less common. Content questions are usually unmarked morphosyntactically and have 
an intonation contour similar to a declarative sentence, but with additional emphasis of 
the interrogative. 


(251) Mandarin 
ni jiào shénme? 
2sc call what 


"What is your name?’ 


The marker ba DÉI has a function similar to a polar question and for practical purposes 
is classified as such. But it has an additional element of supposition by the speaker. In 
some instances it is best translated as a tag question in English. 


(252) Mandarin 
ni chi-fan le ba? 
2sc eat-food PFV Q 


"You must have already eaten (I suppose)?’ 


In another function, ba IE is also an imperative marker. In its function as question 


marker, which is similar to English must in relatively certain hypotheses, it has been 
adopted by a great many languages in China, perhaps because of its very specific seman- 
tic nuances. Both ma and ba can be the actual question marker in complex tag questions. 


(253) Mandarin 
a. nf chi-fan le dui ba? 
2sG eat-food prv right Q 
"You have already eaten, right?' 
b. ni bang wó mái.dongxi, háo ba? 
2sc help 1sc shop good Q 
"You are going shopping for me, right?' 


In both cases ba may be substituted with the more neutral ma, which is accompanied 
with a slight change in meaning. Several more patterns are possible, e.g. shi ma? 7293 
‘cop Q’, kéyi ma? BJ LAIS "be possible Q’, xing ma? 47" "be possible g’. 

Mandarin has a further colloquial marker ne WẸ that has both an interrogative and 
non-interrogative function (Li & Thompson 1981: 300-307). In its interrogative function 
it has an interesting distribution. It can be found in negative alternative questions (A- 
not-A), content questions, and “truncated” or elliptical “questions consisting of only one 


noun" (Li & Thompson 1981: 305) such as ta ne? '(And) how about him/her?’. 


(254) Mandarin 


a. ni qubu qüzhongguó ne? 
2SG gO NEG gO PN Q 


‘Are you (actually) going to China?’ 
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b. nà ni qünáli ne? 
that 2sc go where Q 


"Well, where are you going then?' 


The last function (254b) is what might be called a topic question, the exact meaning 
of which depends on the previous discourse. Yet another connection of ne WE with ques- 
tions is a special type of alternative question construction that we will encounter further 
below. 


As for focus questions, there are several possible patterns. One is a cleft-like structure 
with the copula shi z& and occurs before the focused element. Compare the following 
examples of a polar and two focus questions. If the copula stands sentence-finally, it 
functions as a question tag. 


(255) Mandarin 

a. ni qu zhongguó, shi ma? 

2SG gO PN COP Q 

"You are going to China, are you?’ 
b. ni shi qu zhongguó ma? 

2SG COP go PN Q 

'Is it China that you are going to?' 
c. shi ni qu zhongguó ma? 

COP 2SG go PN Q 


‘Is it you who is going to China?’ 


The same sentences are possible with the marker ba IE. Another marker for focus ques- 
tions is likewise based on the copula but has itself a structure of an A-not-A question, 
shi-bu-shi KZ ‘cop-NEG-copP’, which is why no additional question marker is present. 
It is treated as a single marker here that stands before the focused element or attaches 
to the sentence and functions as a question tag. 


(256) Mandarin 
a. ni qu zhonggud, shibushi? 
2SG go PN Q 
"You are going to China, aren't you?’ 
b. ni shibushi qu zhongguó? 
2sG Q go PN 
'Is it China that you are going to?' 
c. shibushi ni qu zhongguó? 
Q 2SG go PN 


'Is it you who is going to China?' 
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In the latter two examples (256b, 256c), shi-bu-shi may be replaced with the more literary 
shi-fou #27, a combination of the copula with an otherwise uncommon negator. This is 
part of the literary language and cannot function as a question tag. But there are several 
related question tags such as dui-bu-dui X] 45X} ‘correct-NEG-correct’, háo-bu-háo DZ 
if 'good-NEG-good' xíng-bu-xíng 114511 'be.possible-NEG-be.possible', ké(yi)-bu-kéyi WT 
VAS RT BÀ "be.possible-NEG-be.possible', or hui-bu-hui BAS 'be.able-NEG-be.able', all of 
which are of the A-not-A type but do not usually mark focus questions. This is evidence 
that shi-bu-shi actually has the status of a sentence (A-not-A question) in tag questions, 
but of a question marker in polar and focus questions. 

Alternative questions have two main construction types, either mere juxtaposition or 
the use of an interrogative disjunctive háishi We (different from the standard disjunc- 
tive huózhé Ek) (cf. Hólzl 2016b: 20). 


(257) Mandarin 


a. zhen.de O  jiá.de? 
true (or) false 
‘Is that true or false?’ 

b. nï qu zhonggué háishi ribén? 
2SG go PN oro Japan 
‘Are you going to China or Japan?’ 


One very dominant question category in Mandarin Chinese are negative alternative 
questions that exhibit the same two marking strategies as plain alternative questions. 
Juxtaposition is much more frequent and productive in negative alternative questions 
(A-not-A questions) than in plain alternative questions (Hólzl 2016b: 20). 


(258) Mandarin 
nf qüzhongguó (háishi) bú qu zhongguó? 
2SG go PN or.Q NEG go PN 


‘Are you going to China or are you not going to China?’ 


In some cases the whole second alternative is deleted except for negation. This may be 
the basis for the grammaticalization of interrogative particles such as ma DÉI. Note that 
the following two sentences are completely identical in structure. The major difference 
appears to be the fact that ma is restricted to this construction while bà ^^ is still a 
productive negative marker otherwise. But note that in this context sometimes it had 
already lost its tone, which is a sign of grammaticalization (i.e., phonetic erosion, Hélzl 


2015e). 


(259) a. Mandarin 
ní qüzhongguó bu? 
2SG gO PN NEG 
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b. ni qu zhonggud ma? 
2SG gO PN Q 


‘Are you going to China?’ 


In the past tense or in sentences that contain the existential you A 
to replace the second alternative with the negative existential. In the former, case an 
additional marker in the first alternative is necessary. Following Sun Chaofen (2006: 64- 
70), these may glossed as experiential (ExP) guo i and perfective (Prv) le T , respectively. 


D it is also possible 


(260) Mandarin 
ni qu-gud/le zhongguó méi.yóu? 
"Zeg go-EXP/PFV PN NEG 


‘Have you been to China or not?’ 


My informant tells me that, in the first case, the number of times one has been to 
China as well as the exact time is irrelevant, while in the latter the event is thought to 
have happened once and relatively recently. Note that the second alternative may not 
consist of the disjunction and a negator, exclusively (*haishi bu?) (Luo Tianhua 2013: 186), 
which represents a difference with respect to MSEA (Clark 1985). 

In extreme cases the whole second alternative is deleted and alternativity is indicated 
with the help of the disjunctive connective háishi 18 ‘or.Q’, exclusively. These devel- 
opments crucially depend on the context of an elliptical (more precisely analiptic) alter- 
native question. Mandarin also allows open alternative questions. 


(261) Mandarin 
ni qu zhongguo háishi nă.r? 
2SG go PN or.Q where 


f 


In this case, my informant tells me, one of the interrogatives nă W/ná.r YS) L/nă.li WE E 
‘where’ would seem more natural than shénme fT "A ‘what’, which, however, is possible 
in other examples. 

An element that can often be encountered in Mandarin questions is the sentence-final 
marker a IN ~ ya IF that I analyze as enclitic =(y)a. It is not a question marker as such 
but “has the semantic effect of softening the query" (Li & Thompson 1981: 313). Some 
examples will be given further below. Following these authors, the enclitic will be glossed 
as reduced forcefulness (RF). 

As we have just seen, question marking in Mandarin is relatively complex and exhibits 
many different constructional patterns. The same is probably true for the other Sinitic 
languages surveyed in the following. But in the absence of native speakers and a lack 
of detailed information, only some limited information can be given here for each lan- 
guage. Chinese as spoken by the Hui (Chinese speaking Muslims) in Urumdi is relatively 
close to Standard Mandarin. Polar questions have a cognate of ma !4 and content ques- 
tions optionally have a cognate of ne UE. As expected there are also negative alternative 
questions. 
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(262) Mandarin (Urumqi Hui) 


a. tgo" ti?! ma”? 
real Q 
‘Really?’ 


b. ni”? sã ner” tey? 
2SG DIR where go 
‘Where are you going?’ 
c. sei” tcivu 4 vy nal? 
who save 1sG Q 
‘Who will save me (then)? 
d. ni? nən” pu? nan** le? 
2sG be.able NEG be.able come 
‘Are you able to come?’ (Liu Liji 1989: 222, 219, 206, 217) 


Similar to Standard Mandarin, question tags may contain a question marker, e.g. x»? pa?! 
If T, or may have the form of an A-not-A question. 


(263) Mandarin (Urumqi Hui) 
ts T list! fe ^ tset tc y. x9? pu” x5??? 
eat Prv meal again go good NEG good 


‘Let’s go after having eaten, alright?’ (Liu Liji 1989: 221) 


In the following example of an alternative question (264), the first alternative receives 
the marker ne lE that combines with the disjunction and necessarily is preceded by the 
copula that precedes the first focused element. 


(264) Mandarin (Urumqi Hui) 
ni” st’ tg yu” jæ” nitt xe? si? xy! tsa”? 
2sG cop smoke Q or drink tea 


‘Do you (want to) smoke or drink tea?’ (Liu Liji 1989: 221) 


This has an exact parallel in Standard Mandarin (ni shi chouyan ne háishi he chá?). 

An idiosyncratic pattern is the presence of a question marker on the first alternative in 
alternative questions that may also lack a cognate of Mandarin háishi iŒ. This pattern 
was probably influenced by surrounding Turkic languages. Especially intriguing is the 
optional combination of two markers, which is impossible in Standard Mandarin but can 
also be found in Hezhou Chinese (see Table 5.98 below). 


(265) Mandarin (Urumqi Hui) 
ni” ta ni?! ma? vy? tat? 
2sc big Qo. Q jee big 
"Are you older or I?' (Liu Liji 1989: 211) 


261 


5 Survey of the grammars of questions in Northeast Asia 


Xining Mandarin has unmarked content questions, though cognates of Mandarin 
=(y)a W[/Uf and ne YE are optionally present. Polar questions have a marker mo? which 
does not appear to be a cognate of Mandarin ma D, which is attested as ma (Zhang 
Chengzai 1980: 300). The difference with Standard Mandarin is mostly phonological in 
nature. The formal similarity to the Tangut sentence-final question marker mo? is prob- 
ably accidental. 


(266) Xining Mandarin 

a. fei” ja? 
who RF 
‘Who (is it)?’ 

b. D9 5921324 19532 
HON four Q 
‘What about Laosi (the fourth brother)?’ 

c. teia” xa” ma for cua m3”? 
3sG yet NEG speak finish Q 
‘Has (s)he still not finished speaking? (Zhang Chengzai 1980: 300) 


There is also the development from negation to question markers. In the following 
example Standard Mandarin employs the affirmative potential marker (ná-de-dóng). 


(267) Xining Mandarin 
a. na^ tuo. 73 — lig? (py*4)? 
take move ?POT NEG 
‘Can you take it?’ 
b. ni? fă” ts‘) lio! mo?*? 
2sG meal eat PFV NEG 


‘Have you eaten?’ (Zhang Chengzai 1980: 301) 


The negators are cognates of Mandarin bù Ñ (non-past) and méi iX (past). A native 
speaker living in Germany in January 2017 made me aware of the fact that the use of 
mal in example (267a) is not only possible, but perhaps more natural. In April 2017 the 
following examples of alternative and focus questions were recorded. The transcription 
and analysis roughly follow Zhang Chengzai (1980). Given that the speaker appears to 


show strong influence from Standard Mandarin, tones were omitted for simplicity. 
(268) Xining Mandarin 


a. ni tsunku ts^-lia ma xais] Z1pan ts^-lia? 
2SG PN go-?POTQ orQ PN go-?POT 


‘Are you going to China or to Japan?’ 
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b. tsunku tc'y tsuotsis| ni sa? 
PN go ?N COP 2sG Q 


‘Is it you who is going to China?’ 


Example (268a) apprently contains cognates of Mandarin ma IH and háishi i 4. The 
question marker sa in example (268b) appears to also exist in Hezhou Chinese (see be- 
low). 

Dungan questions appear to be very close to Mandarin as well. In the first two ex- 
amples the original was in traditional characters that have been changed into simplified 
characters. In the last example the Chinese characters have been added by me. Dungan 
is usually written with Cyrillic letters, however. 


(269) Dungan 
a. (TIEN A L? 


[ni-men ouzhou donggan ren duo ma?] 
2SG-PL PN PN person many Q 


‘Are there many Dunggan in Europe?’ 
b. MERJE? 

[za-de ne?] 

how-ATTR Q 

‘How are you?’ (Rimsky-Korsakoff 1994: 486, 515) 
c. [SS BOLE BLAIR?) 

ti san tuarxua liti fe-ti sa? 

ORD three cLF speech inside say-Apv what 


"What is said in the third sentence?' (Rimsky-Korsakoff 1967: 382)? 


Question marking in Hezhou/Linxia Chinese is very complex and deviates strongly 
from Standard Mandarin. Table 5.98 summarizes the specialized description of question 
markers by Xie Xiaoan & Zhang Shumin (1990). Especially interesting is a functional 
differentiation of three different question markers for polar and content questions each. 
The marker mi? most likely was borrowed from Uyghur =mu (85.11). The markers la? and 
za’ apparently found their way into some Mongolic languages (85.8). The combination 
of two question markers such as nimu’? is similar to Urumqi Hui Chinese. A double 
marking pattern for alternative questions most likely has been adopted from Turkic as 
well. 

The following two examples of a polar question with the Uyghur question marker 
(270a) as well as an unmarked content question (270b) were given without characters 
and tones. 


23 My Mandarin informant made me aware of the fact that 3# [fei] is usually employed as a character for ‘to 
say' in Gansu province. 
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Table 5.98: Hezhou/Linxia Chinese question markers (Xie Xiaoan & Zhang 
Shumin 1990: passim); notation of vowels slightly adjusted; elements given in 
characters only are rendered here in Pinyin without tones in square brackets 


as an approximation 


Type Form Usage 
PQ ma with negation 
la? with assertion 
nP mu’? polite, solemn, younger towards older 
speakers 
CQ za? PERSON, THING, QUANTITY 
n PLACE, TIME, MANNER, male and young 
speakers 
ni za? PLACE, TIME, MANNER, female and old 
speakers, more polite than n. 
AQ X nP mi), Y n? anticipated actions 
X li? mv’, Y li»? past actions 
X n? mu’, Y ni, [haishi] Z Mandarin háishi oo 
X nb, Y, [haishi] Z 
X mi), Y mè, [haishi] Z 
NAQ Xnimi,wrc (X) 
X là), NEG (X) 
?TO nn sa? 
(t)? nb sae 
[jiushi] la? Mandarin jiüshi ‘exactly’ 
[duizhe]la? Mandarin dui ‘correct’ 
[bushi] pe? Mandarin büshi ‘isn’t’ 
[jiushi] 1a? [shi] relative certainty 
X la [bu](X)[shi] Mandarin bu ‘NEG’ 
ga? 
(270) Hezhou/Linxia 
a. fatsi-xa ` ke-vé-li-zmu? 
house-Acc build-finish-PFv-Q 
‘Is the house built?’ 
b. ni tham-xa frma-la — khué-te-li? 


2sc 2PL-ACC what-INsT wait-?-?INT 
"What would you serve them with?' (Lee-Smith 1996b: 866, 868) 


Dwyer (1995: 158) claims that the Xunhua subdialect of the Hezhou/Linxia Chinese 
has a tag question marker that derives from an interrogative meaning ‘what’. The only 
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example given is a content question, however, and more likely, it is cognate with za? 
seen in Table 5.98. Content questions may also be unmarked. 


(271) Hezhou/Linxia (Xunhua) 


44 > 44 


phai-sá aagi" gə” “pe dzo?-goasltf't d xeur sa”? 
1sG-PL send-nEs which accomplish this-crr matter COP good Q 
‘Whom should we send to take care of this?’ 

b. pi? so ^ ma^! ko mai? lis? 
2sc what buy Prv 


"What did you buy?' (Dwyer 1995: 158, 162) 


a. not-mé 


The negator in the following negative alternative question (A-not-A) (272) is what has 
been called a potential (Por) meaning, cf. Mandarin zháo-bu-ddo ‘seek-NEG.POT-RES’ ‘not 
be able to find’. But the affirmative counterpart would usually require another marker to 
substitute for the negator, cf. Mandarin zhdo-de-dao 'seek-NEG.POT-RES' ‘be able to find’ 
(Sun Chaofen 2006: 60f.). 


(272) Hezhou/Linxia (Xunhua) 
d35^*-gv "dzüon*-dzr xê”, ni? nal zl! -la na'*-bu-xa 
this-cLF heavy-EXT very 2sG carry-RES-Q carry-NEG.POT-RES 


4 419 


‘This is very heavy, can you carry it or not?’ (Dwyer 1995: 173) 


Following Xie Xiaoan & Zhang Shumin (1990), the marker -la has been reanalyzed as a 
question marker here. Dwyer (1995) does not give an example of a polar question. 
Wutun has a polar question marker -a that has been compared to both Mandarin ma 
IZ as well as the interrogative a- ‘which’ (Mandarin nd WE) Janhunen et al. 2008: 99). 
Another possible source might be Mandarin =(y)a I/F. The marker has been called an 
enclitic particle, but was written attached to the preceding word with a hyphen. It is 
analyzed as enclitic here and thus written as =a. Similar to some Mongolic languages 


in the area, the marker fuses with certain preceding suffixes, which speaks in favor of 
an analysis as a suffix. For instance, the continuative marker -zhe, combined with the 
question marker results in the form -zha, which is reminiscent of Mongolic languages of 
the region (85.8.2). Apart from =a, Wutun allegedly has borrowed the suffix -mu from a 
Mongolic source, probably Bonan (Sandman 2012: 384). But a Turkic origin of the Wu- 
tun form is more likely, e.g. Salar =mu, Uyghur =mu ($5.11). Consequently, it has been 
reanalyzed as enclitic here. According to Lee-Smith & Wurm (1996: 894) it has the form 
-me and is cognate with Mandarin ma, which seems unlikely but not impossible. As seen 
above, it has the form m»?* in Xining Mandarin. Wutun lacks a tonal distinction. 


(273) Wutun 


a. je ni-de X huaiqi hai-li=a? 
this 256-GEN book EQU-SEN.INF=Q 
‘Is this your book?’ 
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b. sama qe-lio-mu? 
food eat-PFv=Q 


‘Have you eaten the food?’ (Janhunen et al. 2008): 99, 100) 


The functional distribution of the two suffixes remains somewhat unclear, but =mu 
is said to have been encountered less frequently (Janhunen et al. 2008). Erika Sandman 
(p.c. 2016) informed me that =a is used with imperfective as well as progressive, and =mu 
with perfective as well as resultative aspect. Content questions remain unmarked. 


(274) Wutun 
ni ma-ge  nian-di-yek? 
2sG what-CLF read-PROGR-EGO 
"What are you reading?’ (Janhunen et al. 2008: 98) 


The following examples in (275) were kindly provided by Erika Sandman (p.c. 2016), 
see also Sandman (2016: 287-297). 


(275) Wutun 


a. ni xaitangwa hai-yek-mu ggaigan hai-yek? 
2sc student EQU-EGO=Q teacher EQU-EGO 
"Are you a student or a teacher?' 
b. zang jja-la-gu-la waiqai yek, mi-yek? 
PN visit-INCOMPL-COMP-COND hardship Ex NEG.EX-?EGO 
'If you visited Tibet, would there be any hardships?' 
c. ni yan za-de yek ya? 
28G tobacco smoke-N EX Q 
"You are a smoker, aren't you?’ 
d. nia-ha dun-li-a? 
2SG.OBL-TOP cold-sEN.INF-Q 


‘And how about you, are you feeling cold? 


Whether this last example (275d) can be analyzed as focus questions remains some- 
what dubious. Similar to Japanese (85.6.2) and Korean (85.7.2), it is perhaps better ana- 
lyzed as a polar question with an additional topic. Similar to Tibetic there seems to be 
interaction with egophoricity (§5.9.2.2). The egophoric marker -yek (« yek = Mandarin 
you 8) is “typically used with the first person in statements when the action is volitional 
and allows the speaker's control" (Sandman 2016: 209). There are certain other reasons 
in which the egophoric marker can be used for non-first person (Sandman 2016: 222), 


but none of them seems to apply in examples (274), (275a), or (276). 
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(276) Wutun 
a. ngu-de minze-li dongzhek sho-yek. 
1sG-GEN name-LOC PN say-EGO 
‘My name is Dongzhou? 
b. ni-de | minze-li ma  sho-yek. 
2SG-GEN name-Loc what say-EGO 
"What is your name?’ (Sandman 2016: 294f.) 


Thus, Wutun appears to follow the anticipation rule as described in §4.4 (Tournadre & 
LaPolla 2014: 245; Sandman 2016: 294). Sandman (2016: 226) mentions another interest- 
ing interaction with evidentiality: “The questions with factual evidential [re] differ from 
questions that are true requests for information. In my data, factual evidential was used 
in questions with an obvious answer”. 

Tangwang has also adopted the Uyghur polar question marker =mu and has unmarked 
content questions. Examples provided by Xu (2014) were given without tones. Lee-Smith 
(1996c) does not mention any questions. 


(277) Tangwang 

a. ni na tsua=mu? 
2sc be.able ?live=Q 
‘Can you live?’ (Xu 2014: 357) 

b. ni-m tcia.tsu tci xua jau-lr, ma-jou? 
2sG-PL family GEN life Ex-?IPFV NEG-?EX 
‘Does your family have a life (worth living) or not?’ 

c. ell Del su?*-xa a^ m me” -li? 
this-PL book-?ro» how sell-iPFv 
‘How does one sell these books?’ (Yibulaheimai A. 1985: 36) 


There are also very few recordings of questions in Gangou Chinese. Content ques- 
tions remain unmarked. Alternative questions appear to have just one marker on the 
first alternative. Polar questions most likely have the sentence-final marker ma, but no 
example was found. 


(278) Gangou 


a. aijie amen mei-lai shuo? 
3sc how NEG-come HS 


"Why didn't he come?' 
b. ni liang.ge huaer yao-li ma, bu-yao? 
2sG two flower want-FUTQ  neg-want 


‘Do you two want a flower or not?’ (Zhu Yongzhong et al. 1997: 447, 437) 
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5.9.2.2 Question marking in Tibetic and Qiangic 


Ebihara (2011: 64f.) mentions several different ways of forming questions in Amdo Ti- 
betan. Polar and content questions may optionally take a sentence-final clitic =ni. Its 
origin eludes me, but one might compare it with Hezhou/Linxia ni? WE. 


(279) Amdo Tibetan (Gonghe; Tibetic) 
a. teo wa jon=ni? 
2SG go.out.PRF come=Q 
“You came back?’ 
b. te'ozoka ma-jon=ni? 
why NEG-come-Q 
"Why did (you) not come?' (Ebihara 2011: 65) 


Content questions may also remain unmarked and polar questions have another en- 
clitic =na. The distribution of the two markers among polar questions remains unclear. 


(280) Amdo Tibetan (Gonghe; Tibetic) 
a. cho sta jon? 
2sc who coP.cj 
"Who are you?' 
b. cho demo jan=na? 
2sc fine COP.CJ=Q 
‘Are you alright?’ (a greeting) (Ebihara 2011: 65) 


Polar questions alternatively may be marked with intonation exclusively. There is yet 
another marking strategy for polar questions that is almost unique within the Northeast 
Asian area: Amdo Tibetan possesses a verbal prefix for marking polar questions. Again, 
there is no comment on the functional distribution of this marking strategy with respect 
to the others. But this might be the default marking. 


(281) a. Amdo Tibetan (Gonghe; Tibetic) 
te*o wol a-jan? 
2SG PN Q-COP.CJ 
‘Are you Tibetan?’ (Ebihara 2011: 70, 64) 
b. norba jog-dzi are? 
PN  come-N Q-COP.DJ 
"Will Norbu come?' 


(282) Amdo Tibetan (dPa’ris; Tibetic) 
ico: ` x"ic"a a-jol? 
2sG.DAT book Q-copP.cy 
‘Do you have a book?’ (Ebihara 2013: 155) 
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In the examples provided by Ebihara, the prefix always attaches to a copula, but it is 
not restricted to this context. Consider two examples from the Themchen dialect (283). 


(283) Amdo Tibetan (Themchen) 
a. Ten ts*unwa | a-jon? 
2sc merchant Q-COP.CJ 
"Are you a merchant?' 
b. bdogtcoal a-t'a? 
PN Q-go.PFV.DJ 


‘Has Dekyi gone?’ (Haller 2004: 69, 81) 


Janhunen (2012c: 184) is correct that the prefix represents an important difference of 
Amdo-Tibetan when compared with the other languages of the Amdo Sprachbund. But 
as seen before, Amdo Tibetan has sentence-final particles as well, and as we will see 
further below there are other languages in the region with a similar pattern. Basically, 
the same pattern as in the dialects mentioned above is also found in other dialects such 
as that of Tongren/Rebgong Amdo Tibetan, e.g. tc"o demo jin=na? ‘How are you?’ (see 
280b), and e-jol’ ‘Q-cop.cy’ (see 282) (de Roerich 1958: 98, modified transcription). The 
descriptions of Tibetic languages included here usually do not mention alternative, focus 
or tag questions. However, Tongren/Rebgong Amdo Tibetan has a tag question marker 
e-den-ga ‘Q-truth-?GEN’ (de Roerich 1958: 131, modified transcription). 

Amdo Tibetan has a distinction between conjunct and disjunct marking that usually 
is manifested in the copula system. The distinction has also been adopted from Tibetic 
by several Mongolic (85.8.2) and Sinitic (see above) languages of the Amdo Sprachbund 
(83.5). According to Aikhenvald (2012: 471) “the alternation between conjunct and dis- 
junct person marking marks new information and surprise, especially in 1st person con- 
texts. The disjunct person marking indicates something out of the speaker's control, un- 
expected and thus surprising? As we have already seen in $4.4, there is some interaction 
of conjunct/disjunct marking and questions: "In question sentences for the second per- 
son, the conjunct forms are generally used according to the point-of-view of the second 
person" (Ebihara 2011: 69). In Gonghe Amdo Tibetan, for example, there are special con- 
junct (jan, NEG man) and disjunct (re(I), NEG mare(l)) copula forms (Ebihara 2011: 69), see 
also examples (280,281, 282,283a) above. 

The intonation of Amdo Tibetan questions has been given in quite some detail by Sun 
(1986) for the dialect spoken in Xera village in Northern Sichuan: 


The interrogative word is spoken on a high falling pitch, or, if the interrogative 
word has more than one syllable, a high falling pitch on the last syllable and high 
level pitch on the other syllables. [...] The typical intonation of yes-no questions 
is a high level pitch on /v/ followed by a high falling tune realized on the verbal 
element. (Sun 1986: 60f.) 


What is given here as /v/ corresponds here to the question prefix a- in Gonghe and other 
dialects (cf. Sun 1993: 959). According to Denwood (1999: 128), Lhasa Tibetan «e» /?A/, 
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apparently a cognate of a-, generally has dubitative meaning but also functions as a polite 
question marker for the second person. 

Classical Tibetan has a sentence-final question marker ‘am that also assimilates to 
the preceding word. Content questions remain unmarked. 


(284) Tibetan (Classical; Tibetic) 

a. mi de dgra  yin-nam? 
person that enemy cop=Q 
‘Is that man an enemy?’ 

b. kyod kyis glang brnyas=sam? 
2sG ERG ox  borrow=Q 
‘Did you borrow (his) ox?’ 

c. nga-s khyod kyi khyo  ci.ltar sbyin? 
1sG-ERG 2856 GEN husband how give 
‘How can I give you back your husband?’ (DeLancey 2003: 262, 267) 


According to DeLancey (2003: 267), the Classical Tibetan sentence-final polar ques- 
tion marker ‘am “represents a reduction of an earlier balanced question construction, 
probably *V ‘o ma-V ‘V(or) not-V?’ > V ‘am VT Thus, the development is from a neg- 
ative alternative question construction to a polar question marker. Amdo =na seems to 
correspond to Classical Tibetan =nam (de Roerich 1958: 98). 

Zhongu (286), Baima (287), and Tangut (288) also have a verbal prefix for polar ques- 
tions and unmarked content questions. But Tangut, like Amdo Tibetan, also has a sen- 
tence-final question marker mo?, which formally looks similar to the marker in Xining 
Mandarin (284). 


(285) Tangut (Qiangic) 
nji? ká!  tíja! wj dzjo! moi? 
2SG.HON sugar cane PERF eat Q 


‘Do you eat sugar cane?’ (Gong Hwang-Cherng 2003: 614) 


The Baima question marker has been reanalyzed as a prefix here in analogy to the 
other languages. 


(286) Zhongu (?Tibetic) 
a. e-sa-ka? 
Q-sour-MIR 
‘Is it sour?’ 


b. ts'o(-se) gomo tf'atsa "dara? 
2sG(-DAT) money how.much Ex 


‘How much money do you have?’ (Sun 2003b: 826) 
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Baima (?Tibetic) 


a. tchoP? ko? ngeP re? e?-ndzi 


2PL PN Q-go ?PROGR 


53 i? ? 


‘Are you going to Baima?’ 
b. teho” su re? 
2sG who cop 
"Who are you?’ (Sun Hongkai et al. 1996: 131, 126) 
a. Tangut (Qiangic) 
ma! jij! yu! -ja-tchji!-dju!? 
sky GEN head g-?pot-have 
‘Does the sky have a head?’ (Jacques 2011: 427) 
b. thjij?sjo? thij? dzjwo? tjj — jij rjur-ar? mit njwi? jf? 
why this person alone Acc restrain NEG can COMP 


"Why can't you restrain this person alone?' (Gong Hwang-Cherng 2003: 
614) 


It seems possible that the preverbal question marker is an areal feature. Some other 
Qiangic languages share the same question marking strategy as well. Since all Qiangic 
languages today are located outside of NEA, some examples should suffice (289, 290, 


291). 


(289) 


(290) 


(291) 


Caodeng rGyalrong (Qiangic) 

ngji? ne-lna? ó-to?? 

2sG 2sG.Poss-child Q-exist 

‘Do you have children?’ (Sun 2003a: 498) 


Jiaomuzu rGyalrong (Qiangic) 
no tfha wuta a-ta-mut-w? 
2sc tea this Q-2-drink-2sc 


"Will you drink this tea?' (Prins 2017: 593) 
Guiqiong (Qiangic) 

zo gutchier e-dzi? 

3sG PN Q-COP 

‘Is he a Guiqiong?' (Jiang Li 2015: 304) 


It can also be found in further Amdo Tibetan dialects and other Tibetic languages of 
the region such as the gSerpa variety in northwestern Sichuan. 


(292) 


Amdo Tibetan (Xara; Tibetic) 

tc^o xabda a-s"op? 

2sG deer.chase Q-go.coM 

"Did you go deer-hunting?' (Sun 1993: 959) 
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(293) gSerpa Tibetan (Tibetic) 
martsa  həts*o-lə rd3avuia ytsa-ve ` o-yo? 
originally 2PL.family-DAT louse ^ one-except Q-coP 


‘So there was just one single ‘louse’ in your home?’ (Sun 2006: 125) 


Possibly, there are areal connections to Chinese dialects, too. Consider the following 
example from the Hefei dialect spoken in central Anhui province. 


(294) Chinese (Hefei) 
[ni ke-xiang.xin] 
2sc Q-believe 
‘Do you believe (that)?’ (Dexi 1985: 12) 


In this dialect the marker has the form k‘a?! or ko?!. An investigation of the extent of 
this feature towards the south goes beyond the possibilities of this study. But at least 
Mandarin as spoken in Yunnan also has this pattern. Independent of that question, it 
represents a southern border of the NEA area as no other language in the sample has a 
comparable pattern. It may also be noted that Qiang, the southern neighbor of Baima and 
Zhongu does not share this pattern (84.2.3, LaPolla & Huang Chenglong 2003: 180). The 
marker sometimes can exhibit a rather complex morphosyntactic behavior. For example, 
in Prinmi, a language also spoken in Yunnan, the marker has the form a and usually, but 
not always, takes penultimate position in a sentence (Ding 2014: 208). 


(295) Prinmi (Qiangic) 
a. po" pó. at=zel? 


uncle Q-EX.AN 
'Is Uncle home?' 
b. Viel dzi#=a"=fol? 
meal eat=Q=OPT 
"Will (you) have a meal?’ (Ding 2014: 209) 


Consequently, the marker can stand both in front of or after the verb. In Japhug (rGyal- 
rong, Qiangic), to mention yet another language from Sichuan with the feature in ques- 
tion, the prefix apparently invariably has the form ur- (and usually receives stress) (see 
Jacques 2004: 400f.). However, the form of the marker is sometimes variable. In Baima 
the question marker has several different variants shown in Table 5.99 that are deter- 
mined by the vowel of the following verb, i.e. it exhibits some form of umlaut. However, 
unlike Prinmi, there is no change in tone. 

Whether the markers in all languages mentioned above actually are cognates of each 
other could not be settled here but seems likely except for Chinese. However, this is 
irrelevant from an areal and typological perspective. 
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Table 5.99: Variants of the question marker in Baima (Sun Hongkai et al. 1996: 


85). 
Example Meaning 
e?-ndzi?? Q-go 
e? tfe Q-cut(.tree) 
-ne Q-sleep 
a?-dz3?4 Q-mend 
a?3-t5? Q-wrap 
55?-kho?? Q-back 
0?-phu? Q-rub 
g^ - py? o-smell 
3°>-khe™? Q-toast 
2??-ndzg? o-hit 
29-dzue?! Q-pick(.potatoe) 


5.9.2.3 Summary 


Table 5.100 summarizes the marking of polar and content questions in Trans-Himalayan 
languages located in Northeast Asia. Clearly, there is a tendency for marked polar ques- 
tions and unmarked content questions. 


Table 5.100: Polar and content question markers in Trans-Himalayan languages 
spoken in NEA. Tones are often variable and were thus excluded here. 


PO CQ 
Old Chinese *c*a [hu]# - 
Mandarin ma# -, ne# 
Urumqi Hui Chinese mast -, nost ~ ni# ~ mic# 
Xining Mandarin mo# -, le# 
Gangou ?mast $ 
Hezhou/Linxia ma#, la#, (ri)mu# -, Za#, rit, niza# 
Wutun PFV, RES =mu#, IPFV, PROGR =a# - 
Tangwang =mu# - 
Amdo Tibetan ə-v, =na#, =ni# -, =ni# 
Zhongu EM - 
Baima e-v (see Table 5.99) - 
Tangut ja-v, mos - 
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5.9.3 Interrogatives in Trans-Himalayan 
5.9.3.1 Interrogatives in Sinitic 


Sinitic, and particularly Mandarin, interrogatives are special in several regards. First, 
modern Mandarin interrogatives differ strongly from those in Old Chinese. Second, many 
of the forms are transparent formations that indicate a recent origin. Let us first address 
the Old Chinese interrogatives. Table 5.101 gives their recent reconstruction in the Baxter 
Sagart system. 


Table 5.101: Interrogatives in Old Chinese according to Baxter & Sagart; Baxter 
& Sagart’s (2014a; 2014b) reconstruction; Middle Chinese is only an approxima- 
tion; square brackets indicate uncertain sounds; forms marked by a question 
mark were not actually reconstructed as interrogatives by Baxter & Sagart (cf. 
Pulleyblank 1995: 91-97) 


Meaning Character Modern Reading Middle Chinese Old Chinese 


how d an 'an *[?]‘a[n] 

how Lj yàn jen "?a[n] 

what fn hé ha *[g]'aj 

what A hé hat *[g]'at 

(dial. for 

faf) 

why, what 3 xi hej *[g]'e 
howmany % (JL) ji kj+jX "ka 

how E qi khj+jX *C.qhaj? 
(rhetorical) 

who E shü dzyuwk *[d]uk 

who Ers shui dzywij *[d]uj 

who He chóu ?drjuw ?*[d]ru 

why, how 1H hu ?hu ?*[g]'a 

why HA hüwéi ?hu + hjwe ?*[g]'a + *c"(r)aj 
why not E hé ?hap ?*m-[k]'ap 

(< fef A?) 

how, where (F), E ` wü(hü), wi ?'uh,'u ?*?'aks, *[?]*a 


(< FMA?) 


Thus, Old Chinese may have had resonances in *?~, *d-, and *g*- as well as several 
other interrogatives without such a submorpheme (cf. Pulleyblank 1995: 91). Regardless 
of whether the pharyngealization hypothesis (indicated with /*/) turns out to be true 
(Baxter & Sagart 2014a: 68ff.), several forms qualify as K-interrogatives. Perhaps, the 
polar question marker *c'a * also belongs here. The exact analysis of most forms is 
unclear. But note that at least in some interrogatives analyzable morphological elements 
may have been present. The difference between "[d]uk $ and "[d]uj #£ is especially 
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intriguing. According to Pulleyblank (1995: 92) the former belongs to “a group of words 
in *-k [...], which are confined to preverbal position referring to the subject, and which 
usually select the subject from a larger group.” Xu (2006: 236ff.) agrees in part with this 
assessment and argues that a group of words ending in *-j (such as "[d]uj ##) are more 
flexible in their syntactic behavior than those in *-k (such as "[d]uk 74). Pulleyblank 
(1995: 91) furthermore assumes that several interrogatives including dn % and yan #5 
are derived from the "coverb" (perhaps better called preposition) yú D: in combination 
with unspecified other elements. In fact, Baxter and Sagart reconstruct the form as "[?]a 

^. which makes a connection with *[?]'a[n] % and *?a[n] SS seem possible. But if this 
assumption is true, the preposition must have fused with a following interrogative. These 
approaches are far from offering a clear picture of the etymology or morphology of Old 
Chinese interrogatives. To track the development of interrogatives—or of questions in 
general for that matter—goes well beyond the possibilities of this study (but see Peyraube 
& Wu 2005). 

Colloquial Mandarin Chinese (Table 5.102) potentially has only one interrogative that 
is synchronically non-analyzable, namely shéi (shui) ‘who’. All other interrogatives are 
analyzable to different degrees. Some are straightforward combinations of an interrog- 
ative and a noun such as shénme difang ‘where’. The second part simply means ‘place’, 
but the first element shénme ‘what’, like zénme ‘how’ possibly contains a suffix -me with 
an opaque meaning. In the complex interrogative zén(me)-ydng ‘how, what kind of, in 


what way’, the element -me may be omitted, which speaks in favor of an analysis as a suf- 
fix. Other interrogatives productively combine with grammatical elements such as the 
classifier ge “+. A special case is the interrogative gàn.má ‘to do what, why’, which quite 
clearly is a contraction of the transparent formation gan shénme ‘to do what’. The first 
element gån ‘to do’ remains transparent, but the second element má is what is usually 
called a cranberry morph, because it is not attested outside of this word. The interroga- 
tives ji- ‘how many’ or ná- ‘which (one) do not qualify as “basic question words" either, 
because they necessarily combine with another element such as a classifier. The lack of 
a strongly developed resonance speaks in favor of a relatively new system of interroga- 
tives. In fact, only shéi (shui) WÈ and jï- JL can be traced back to Old Chinese. 

However, apart from the interrogatives mentioned in Table 5.102, Mandarin has about 
a dozen or so formal interrogatives given in Table 5.103 that are mostly restricted to the 
literary language and preserves Old Chinese *[g]'aj {*.74 

To my knowledge, in NEA only Mandarin has such a marked contrast between two 
different sets of interrogatives that depend on style. 

Mandarin interrogatives display strong paradigmatic similarities with the demonstra- 
tives. Mandarin is not usually analyzed as having paradigms, but nevertheless such an 
analysis seems viable (Table 5.104). 


Ditt may be noted that Japanese also preserves the character fi] but has an autochthonous pronunciation 
nani DLC instead. 
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Table 5.102: Mandarin Chinese interrogatives and their analysis (mostly based 
on own knowledge, elicitation, Ross & Ma 2006: 160f.); not all combinations 


are shown 

Meaning Character Form Analysis 

how + ADJ EA duo + ADJ < ‘very’ < ‘much’ 

e.g., how long £k duó-cháng cháng 'long' 

how many/much Z duó-sháo duo ‘much’, sháo ‘few’ 

how + ADJ if háo- * ADJ « ‘very’ « ‘good’, dialectal 
variant of duo 

how many/much JL ji- + CLF usually ji-ge JL 

at what time JV b) ji-dian(zhong) dián(zhong) ‘o’clock’ 

which (one) ER na- + CLF usually nd-ge WA 

where WSL na-li/-(e)r/-bian ` -li/-(e)r/-bian 

who pi shéi (shuí) 

what FA shén-me -me as in zěn-me?, 
<-n-m-> = [-mm-] 

what ri shá colloquial variant of shénme 

what Vi ma colloquial variant of shénme 

where TAHA shénme difang  difang ‘place’ 

when TAHR shénme shíhou shíhou ‘time’ 

why HIFA wèi-shénme wei ‘for’ 

to do what, why FIA gàn shénme gàn 'to do' 

todo what, why = i gan.ma colloquial variant of gan 
shénme 

on what basis FATA ping-shénme ping ‘rely on’ 

how, why EA zěnme -me as in shén-me?, 
<-n-m-> = [-mm-] 

how WE zá colloquial variant of zénme 

how E zén(me)-yang yang ‘kind’ 
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Table 5.103: Formal or literary Mandarin Chinese interrogatives (Ross & Ma 


2006: 161f.; Pulleyblank 1995, partly elicited); not all forms listed 


Meaning Character Form Base Meaning 
why A hé-bi bi necessarily 
why not HY AN hé-bà bü not 
when (rhetorical) Ay hé-céng céng once 
how could (one) not {7% hé-chang chang once 
why not A hé-fang fang impede 
why HA hé-gü gü reason 
when Eig hé-shí shí time 
what is Hf hé-wéi wéi COP 

how, why al E hé-yi yi with, use 
how Amt rü-hé rü to be like 
why AYA wei-hé wei for 


Table 5.104: Partial demonstrative and interrogative paradigms in Mandarin 


(my knowledge) 

this that which 
plain zhe- na- na- 
X-PL zhé-xie na-xié na-xié 
X(-NUM)-CLF zhé(-yi)-ge na(-yi)-ge na(-yi)-ge 
id. (fused) zhéi-ge néi-ge néi-ge 
X-LOC zhé-(e)r nà-(e)r ná-(e)r 
X-LOC zhe-li na-li na-li 
X-LOC zhé-bian nà-bian ná-bian 
DIR-X-LOC wang-zhe-(e)r wang-na-(e)r wang-na-(e)r 
ALL-X-LOC dao-zhé-(e)r dao-na-(e)r dao-na-(e)r 
ABL-X-LOC cóng-zhé-(e)r cóng-nà-(e)r cóng-ná-(e)r 
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The interrogative duo is usually used with scalar adjectives such as duo-jiü & A ‘how 
long’. Strangely, as my informant tells me, some dialects such as those of Guiyang use the 
cognate of Mandarin hdo D instead. Literally, duo & means ‘much’ and háo #¥ ‘good’, 
but notably both share an emphatic meaning of ‘very’. Sentences with both duo and 
háo can have their original meaning and may be regarded as polysemous. The reading 
depends on the intonation as well as the context. The following elicited example has 
been idealized and standardized to allow a better comparison. 


(296) Mandarin 
zhé tiáo hé | duo/háo kuan? 
this CLF river how broad 


‘How broad is this river?’ 


Guiyang is located outside of NEA, but the same phenomenon can also be observed, for 
example, in the Shiquan dialect in Shaanxi, which has the form xao? lif (e.g., xao? teiu® 
TFA ‘how long’). 

There are more descriptions of interrogatives in Chinese dialects than can possibly be 
mentioned here. However, the majority simply rely on a transliteration with characters 
and do not give a phonetic transcription, which makes the data problematic at best. The 
following gives the interrogatives from a selection of different dialects, namely Suide 
2% $8 (northern Shaanxi), Shiquan D P (southern Shaanxi), Yanggao HI i (northern 
Shanxi), Lingshi RA (eastern Shanxi), and Xining V8 T^ (eastern Qinghai). In addition, 
the interrogatives of Hui Chinese spoken in Urumqi 7X37 (northern Xinjiang) are 
given. The list is not meant to provide an exhaustive overview, but gives an impression 
of dialectal variation found in northern Mandarin (Table 5.105). 

There is a bewildering variety of different forms and combinations of forms that is 


qualitatively different from most other interrogative systems observed in NEA. Often a 
specific function may be expressed with a wide variety of different forms. For instance, 
the Yanggao dialect is said to have twelve locative forms. Only a selection of forms is 
included here. The Suide, Yanggao, and Lingshi dialects represent the disputed Jin dialect 
area that is sometimes distinguished from Mandarin. An interesting feature shared by 
these dialects is a final glottal stop such as in the classifier ^ (Mandarin ge, Shiquan go, 
Xining ko, Urumqi Hui Chinese ky, but Suide kua?, Yanggao ka?, and Lingshi ka?, here 
given without tones). Many forms that were not listed cannot be found in Standard Man- 
darin. For example, Mandarin cannot use the plural marker -men fî] (Lingshi su** man", 


Xining fei?*?! m3 7494, Urumqi seit man”, Yanggao suei*!? mar!) or the classifier (yi)ge — 


^ ‘one cLF' in combination with the personal interrogative shéi ‘who’ (Urumqi sei?! ky” 


~ sei?^ ji?! ky?!), 

A special case is Lingshi ue**su*4 TUE ‘who’, which contains a demonstrative ue 
unknown in Mandarin. Whether Xining a^ mo ?*** ‘why, how’ is related to Mandarin 
zénme Ę A ‘why, how’ or na D ‘which’ remains somewhat unclear. However, it has 
clear parallels in Hezhou, Wutun, Tangwang, and Gangou Chinese. 


44 
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Table 5.105: A selection of interrogatives from Suide (northern Shaanxi, 
Weiqiang & Yongliang 2013: 51, slightly corrected), Shiquan (southern Shaanxi, 
Shi Feng 2009: 14), Yanggao (northern Shanxi, Sun Qinglin 2015: 150, vowel 
transcription slightly corrected), Xining dialects (eastern Qinghai, Zhang 
Chengzai 1980: passim), and Urumqi Hui Chinese (Liu Liji 1989: 160f., passim) 


Mean. Suide Yanggao  Lingshi Shiquan Xining Urumqi 
who suei?? sue? su ^^ fei?* sei ka? 
which, nai? ? ia” la” go?! att kə?’ na?! ky”? 
who ko??? ko?^4 
what sa??-ma, sa? sop? ga?” sa?” sms), 
so? sa24 
which La? na? Pia? la? 
where la?’ (4i) na? le? la Di ner? 
where la?! -ter ` na? tor?! latag?! ^ 4194535 
eu 
when sor? sor? 
xur xur^ 
to^ tuy?! tuy?! 
ur xuər?t xuor^ 
Leif? ei tei”? tei” et! 
sa??? 
e sai? ail sa?” 
xyor?4 (xou?) s» 
xuj 2254 
how tsua?” reali? tsa ^ti ^4 291421. tsa??- 
tsa?4 
why vei’ (ian?!) ig?» uei?! uei?P? 
san’, vef A sað!  uej5? ga? eet 
vei?! sor? 
sa??-ma 
how tan’? tuy xao? tus tuy? 
how to? tuy?! tei??? tu?4 $37 tuy?!g5?? 
much sl sou? s321? 
how tgi?! tei? tei?! tei 
many 
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Table 5.106: Xunhua and Jishishan Hezhou interrogatives (Zhong Jinwen 2007: 


393f.); forms in square brackets from Dwyer (1995: 156) 


Meaning Xunhua Jishishan 
who, which a9 k355 [a-zi?4-go ~ a-ji?*-go] ^ a ffj?kof* 
where ah? a^ n^n? 
how a?! mu? liour? a^ mu^ liao?? 
what sima? S? ma* 
what [sa?] 

what tgo??liour 

what (emph.) [so ? ma?! ka] 

how much/ many t5? ^g? tuo?!sao f^ 
why tsa?liou sg? a^^mu* 
where [a^-dr-r] 

(?Turkic -DA) 

why [aP-men?*dza] 


For Hezhou dialect only a few descriptions of interrogatives are available (Table 5.106). 
Both varieties of Hezhou listed have lost the initial nasal in the cognate of Mandarin ná 
WU and contain some etymologically opaque derivations. 

Rimsky-Korsakoff (1994: 513, 515) mentions the two Dungan forms dza E and sa ls. 
Hai Feng (2002: 76) also only lists tci?! s?^ JLH} ‘when’ and sol! EE ‘what’, but includes a 
notation of tones. We have already encountered all these forms in several dialects above. 

Despite the fact that some Wutun interrogatives are cognates with Mandarin, the 
overall picture is quite different. There are two new resonances being built up (a- and 
ma-), neither of which exists in Mandarin. Wutun has only one basic interrogative word, 
ma ‘what’, which possibly is a contracted form of Mandarin shénme ÍT 4 > má fii ‘what’. 
A combination of shénme fT 44 with ge * as in Wutun ma-ge is impossible in Mandarin, 
but not in the Xunhua subdialect of Hezhou, which has s2? ma*!ka fT A S. The develop- 
ment of the meaning of Wutun a-ge from ‘which one’ to ‘who’ has parallels in several 
Mandarin dialects. Interrogatives in Tangwang are relatively straightforward. Only the 
origin of what appears to be a locative suffix -tha remains unclear for now. The descrip- 
tion does not seem to be very reliable as individual interrogatives are given in different 
forms throughout the book (Xu 2014). Only for some forms tones were given, which is 
why they have been removed altogether. Some unclear forms were left aside. 

There are no good descriptions of questions in Gangou Chinese. Only in the last two 
years have there been any studies of the language in China at all. But they are all from 
one and the same scholar Yang Yonglong (e.g., 2014: 244f.), who does not give sufficient 
information on pronunciation or grammar and for the most part employs Chinese char- 
acters for transcription and dialect for translation. According to Yang Yonglong (2014), 
the interrogatives can have a plural marker [-men] f|] that is said to be pronounced /mu/. 


280 


5.9 Trans-Himalayan 


Table 5.107: Tangwang (Xu 2014: 222, passim), Wutun (Janhunen et al. 2008: 
69f.; Erika Sandman p.c. 2016), and Gangou interrogatives (Zhu Yongzhong et 
al.1997: 447; Yang Yonglong 2014: 244f. with Mandarin transliteration in square 


brackets) 
Meaning Tangwang Wutun Gangou 
who a-ke a-ge [age] 
where ah a-li 
where a-tha a-ra/la [a-biao] 
when a-Xxul 
how, what kind of am(tse) 
why, how amutse a-menzai a-men 
what kind of amutgrko 
what sama ma(-ge) [sha] 
when soma g|xou 
how much tuago do 
when do-xige [a-hui(e)r] 
how many jhi-ge 
why ma-shema 


5.9.3.2 Interrogatives in Tibetic and Qiangic 


Table 5.108 and Table 5.109 give some interrogatives from seven different Tibetic or 
Qiangic varieties, heuristically classified into those with and without tones. 

For the gSerpa dialect from northern Sichuan, Sun (2006) only mentions the interrog- 
ative tc"o ‘what’. This interrogative stem, present in all varieties mentioned here, has 
been reconstructed as ("tyi >) "te(h)i ‘what’ for Proto-Tibetic, showing palatalization 
characteristic for Tibetic (Tournadre 2014: 114). The derived form, e.g. t/^a-tso in Zhongu, 
according to Sun (2003b: 831), has the underlying Written Tibetan form ci.cig, in which 
the second element seems to be the indefinite article cig, derived from the numeral gcig 
‘one’ (DeLancey 2003: 263). A parallel can be found in some Mongolic languages of the 
Amdo Sprachbund (85.7.3). 

The plural may be formed by reduplication, which has been adopted by some Mongolic 
languages of the region (85.8.3), e.g. Gonghe Amdo Tibetan s" s*a ‘who (PLY (Ebihara 
2011: 54) or Baima su” su? ‘who (PLY (Sun Hongkai et al. 1996: 78). But in Themchen 
Amdo, for example, there are the plural forms s"a-te"u and kar-tc^u, instead. The lan- 
guages share stems for ‘who’, ‘which’, ‘when’, and ‘what’, the last of which is the basis 
for several derivations. For instance, Gonghe Amdo Tibetan tc*a-zek-a contains a dative 
case that has the form -(k)a following k (Ebihara 2011: 60). This seems to be an exact paral- 
lel to Themchen tc"2-zoc-a, Zhongu t/"á-tsa-jo (Sun 2003b: 797, fn. 51), and possibly Cone 
tc^3H -z3-- yel. There are parallel formations in some Mongolic languages of the region 
(85.8.3). The Gonghe interrogative tc"a-gi ‘how’ as well as its Ihemchen cognate te”a-yi/- 
ji apparently contain a purposive or causative conjunction (Ebihara 2011: 71). Given the 
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Table 5.108: Gonghe (Ebihara 2011: 64), Themchen (Haller 2004: 51, 55, 66, pas- 
sim), rNgawa (Suzuki 2006: 82), and Zhongu interrogatives (Sun 2003b: passim) 


Form Amdo Amdo Amdo Zhongu 
(Gonghe) (Themchen) (rNgawa) 
who s"o Bei s"u sə 
which kay kan ken ko-te 
where kan=na kay-na, kor-tui ko-no 
tc"o-na 
how many/much to, to, tc^i-mo-zo, tf"á-tso 
tc'o-mo-zek teə-mu-(-zəç) _kante"a"ta, 
dogqrite*s 
what te"a-zek te"a(-zac) tea-za tf"o(-tsa) 
why, how te"s-zek-a tc"o-zac-a kan-tur tf^á-tsa-jo 
how tc"o-gi tc'"o-yi/-ji 
when ?nem nam nam, no 
kor-tu 
Table 5.109: gSerpa Tibetan (Nagano 1980: passim), Cone Tibetan (Jacques 2014: 
passim), and Baima interrogatives (Sun Hongkai et al. 1996: 77ff. 348f., passim) 
(HILIN -= high/low/neutral tone) 
Form gSerpa Cone Baima 
who suH su? ~ su? 
which gan'ndiN ka" le”? 
where goneF kaal nai kala” 
how many/much 26H ra?H. éiNre?N tc'onl-dzeH tfh? z335 ; 
kako”, 
tfh z3 
what WC ha? 
how Gru. tc^aH- zal yel kaPtso*?? 
why, how elef ei toi dai. — te3H- ytsa- 
chaflivininan! 
when nant nælwõðl ndzo'ye" ` no?ndza? 
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analyzability of many of the forms, for example those starting with te”- in Amdo Tibetan, 
there is no clear resonance phenomenon in any of the languages mentioned. Apart from 
the stem kan ‘where’, Ebihara (2009: 165) also mentions the form kar-ya, in which the 
dative takes the regular form -ņa following n (Ebihara 2011: 60). Ebihara (2013: 157) also 
noted an interesting difference in forms meaning ^whence' with either the ablative or 
the genitive in different dialects of Amdo Tibetan. 


(297) Amdo Tibetan (rGya ya) 
cho kag-ne ` jorn-ne? 
2sc where-ABL come-AUX 


"Where are you from?' 


(298) Amdo Tibetan (dPa'ris) 
tc'o kag-ngo ` ri-le? 
2sG where=GEN come-AUX 


"Where are you from?’ (Ebihara 2013: 157) 
There are parallel interrogative and demonstrative paradigms with a three way con- 


trast similar to Japonic and Koreanic. Table 5.110 illustrates these with data from the 
Themchen dialect. Only the distal demonstrative shows an exact parallel. 


Table 5.110: Demonstrative and interrogative paradigms in Themchen Amdo 
Tibetan (Haller 2004: 51, 64, 66) 


PROX (speaker) prox (hearer) DIST which 
DAT nda ta kan-a kan-a 
LOC nda-na ta-na kan-na kan-na 
ABL nda-ni ta-ni kan-ni kan-ni 


Finally, let us have a brief look at the interrogatives from the extinct language Tangut. 
Gong Hwang-Cherng (2003: 617, passim) mentions sjwi!, sjwi? ‘who’, ljo? ‘where’, lji! 
‘which’ , wa? ‘what’ , wa? zjij! ‘how many/much’, zjij!-ma* ‘how many kinds’, and thjij? 
(sjo?) ‘why, how’ (1 = level tone, 2 = rising tone). The forms are very different from Ti- 
betic and even Qiang (LaPolla & Huang Chenglong 2003: 53), which indicates a relatively 
long time of separation (see also Chirkova 2012). However, the Qiang interrogative sys- 
tem is relatively innovative with many forms, as in the Tibetic varieties above, being 
based on ni(y)i ‘what’. The Qiangic personal interrogatives seem to be among the most 
conservative (e.g., Qiang so, Guiqiong su etc.) and are probably cognates of the Tangut 
and Tibetic forms above. 
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5.10 Tungusic 


5.10.1 Classification of Tungusic 


Recently, Janhunen (2012d: 16) suggested the following classification of Tungusic that 
is based mostly on previous work done by other researchers (Ikegami 1974; Lie 1978; 
Doerfer 1978a; Georg 2004). The designations of languages has been slightly adapted. 
The primary split in Tungusic is between northern and southern Tungusic, both of which 
subsequently went through a secondary split. Thus, there are four main branches that, 
following Janhunen, may be called Ewenic, Udegheic, Nanaic, and Jurchenic. There are, 
however, several minor problems with Janhunen’s classification. For example, it does 
not show the strong dialectal division of some of the languages. Even and Evenki, for 
instance, are said to have about 12 and 50 dialects, respectively (e.g., Malchukov 1995; 
Atknine 1997). The classification of Solon into three different languages is too detailed, 
whereas the dialects of Oroqen are not even mentioned (e.g., Whaley & Li 2000, see 
Figure 5.5). 

While the dialectal divison of Nanaic, Udegheic, and Ewenic is rather well understood, 
there is almost no attempt at a classification of Jurchenic. The Jurchenic branch has been 
named after Jurchen, the oldest attested Tungusic language. Several scholars have tried 
to give an adequate account of the relation of Jurchen and Manchu, the second oldest 
attested Tungusic language. Janhunen (2012d: 6) claims that, despite “slight variation in 
the dialectal basis”, the three Jurchenic languages Jurchen, Manchu, and Sibe “may be 
classified as a diachronic sequence of a single language”. However, even if we consider 
Jurchen an archaic form of Manchu as does Janhunen, apparently following Doerfer 
(1978a: 12), this is imprecise and somewhat misleading. Doerfer’s classification, of course, 
was written before the bulk of information necessary became available during the 1980s, 
when mainly Chinese linguists started to produce grammatical descriptions of Jurchenic 
varieties. Except for Sibe, these have mostly been neglected in western descriptions. I 
tentatively propose a new classification of Jurchenic (e.g., Hólzl 2017b; 2018a, Figure 5.6). 

The exact branching structure, especially the precise relation of the three hypotheti- 
cal branches, has yet to be investigated. Until recently, Alchuka and Bala were almost 
unknown in the West (e.g., Mu Yejun 1985; 1986; 1987; Ikegami 1999 [1993]; Hölzl 2014a: 
212; Hólzl 2015a: 136 fn. 27; Hólzl 2017b; 2018a,a). Bala is basically Jurchenic but exhibits 
some influence from several other Tungusic languages (Mu Yejun 1985; 1986). Alchuka 
preserves some archaic features (e.g., an initial k-, and a verbal suffix -zï < *-si, Hölzl 
2017b; 20182), but also has unique innovations (such as the loss of several word inter- 
nal consonants) and displays some interference from Manchuic. The existence of two 
distinct Jurchen languages has also been recognized by Kiyose (2000). They have been 
called Jurchen A (Bureau of Translators, Kiyose 1977) and Jurchen B (Bureau of Inter- 
preters, Kane 1989) in analogy with similar cases, such as Tocharian A and B. Given 
that Sibe, located in Dzungaria since 1764, has been relatively isolated for over two hun- 
dred years and was strongly influenced by Khorchin Mongolian before that, it has to be 
kept apart from those dialects still located in Manchuria (e.g., Aihui, Lalin/Jing, Sanjiazi, 
Yibuqi). All modern varieties, except Bala and Alchuka, may be classified as Manchu di- 
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Figure 5.5: Classification of Tungusic 
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— Written Manchu (since 1600) — Written Sibe 


— Manchuic + —————— ‘Jurchen B (ca. 1500) 


L — —— Manchu dialects + (Manchurian dialects 


— Dzungarian dialect(s) (Spoken Sibe) 
— Written Jurchen (12'-15" c.) 


—- Balaic "Jurchen A (ca. 15'^ c.) 


*Bala dialects 


L- Alchukaic — *Alchuka (sociolects?) 


° = only historically attested = no speaker left t) = moribund, almost extinct 


Figure 5.6: Proposed new classification of Jurchenic 


alects that, together with Written Manchu and Jurchen B, form the Manchuic branch of 
Jurchenic. Bala, together with Jurchen A (Mu Yejun 1987 also saw this connection), form 
a branch on their own called Balaic. The distinction between Jurchen A and Written Ju- 
rchen is mostly heuristic in nature. Technically speaking, if the above classification is 
correct, the forerunner of Alchuka might be called “Jurchen C” but does not seem to be 
attested. Only the somewhat mysterious language of the Kyakala in China (kiyakara in 
Manchu) had to be excluded for lack of data, but it seems to be a mixture of different 
Jurchenic varieties as well as, perhaps, some other Tungusic languages (see Hólzl 2018b 
for details). 


5.10.2 Question marking in Tungusic 


In Evenki, there are two ways of expressing polar questions. The first relies on a change 
of intonation: “The focus, as a rule, attracts the intonational nucleus on to itself, the 
intonational contour being higher and more prolonged than that of the corresponding 
positive sentence” (Nedjalkov 1997: 4f.). The following example can mean both “They 
killed the elk’ (ty being the intonational nucleus) and ‘Did they kill the elk?’ (with the 
tone peak on tyva vaa). 


(299) Evenki 
nunartyn moty-va vaa-re-Q. 
3PL elk-acc kill-NoN.FUT-(3PL) 


‘They killed the elk./Did they kill the elk?’ (Nedjalkov 1997: 4f.) 
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The second way of expressing a polar question makes use of an enclitic =Ku that can 
have several variants depending on the preceding sounds, =gu, =ku, =nu, and =vu. The 
enclitic attaches to the verb in polar questions, to the focused element in focus questions, 
and occurs twice in alternative questions. 


(300) Evenki 


a. si ulle-ve | d'ep-che-s-ku? 
2SG meat-Acc eat-PST-2SG=Q 
‘Have you eaten the meat?’ 
b. si ulle-ve=gu d’ep-che-s? 
2SG meat-ACC-Q eat-PST-2SG 
‘Did you eat meat/the meat?’ 
c. tar asatkan soro-d'oro-n- gu, in’ekte-d’ere-n=nu? 
that girl cry-PRs-3sG=Q  laugh-Pns-3sc-Q 
‘Is that girl crying or laughing?’ (Nedjalkov 1997: 136, 7) 


This question marker can be traced back to Proto-Tungusic (Benzing 1956: 147) and ex- 
hibits formal and functional similartities to the Mongolic question marker (85.8.2), which 
might indicate an old loan relationship of unclear direction. Buryat =gii and Khamnigan 
Mongol =gv might be relatively recent loans from Evenki. 

Since Tungusic has different negators depending on the clause type and other factors 
(Hólzl 2015a), negative alternative questions show different patterns as well. For standard 
negation, many Tungusic languages employ a negative verb. The question marker that 
is found once in polar but twice in alternative questions attaches to the first alternative 
and to the conjugated negative verb while the rest of the second alternative, including 
the lexical verb, is deleted. Below we will encounter negative alternative questions with 
other negators. 


(301) Evenki 
eme-d'e-n-gu ele tar asi, e-te-n-gu? 
come-FUT-3sc-Q here that woman NEG-FUT-3SG-Q 


‘I wonder if that woman will come here or not. (Nedjalkov 1997: 7) 
Content questions do not show the enclitic and remain unmarked. 


(302) Evenki 
ekun-duk eme-che-s? 
what-ABL come-PST-2sG 


"Where did you come from?' (Nedjalkov 1997: 3) 


Focus questions may also remain unmarked morphosytactically, in which case the 
focused element seems to take second position (cf. Nedjalkov 1997: 135). 
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(303) Evenki 
a. si [er dukuvun-ma] ga-cha-s? 
2sG this book-Acc  buy-Psr-2sc 
‘Did you buy this book?’ 
b. [er dukuvun-ma]si ga-cha-s? 
this book-Acc 2sG buy-PsT-2sG 
‘Did you buy this book?’ (Nedjalkov 1997: 5) 


Question marking in Evenki dialects does not appear to differ much from Standard 
Evenki. The following examples were drawn from the Sakhalin dialect that belongs to 
the eastern group of dialects (Atknine 1997). In this dialect the enclitic has a long vowel 
but displays the same semantic scope and morphosyntactic behavior. 


(304) Evenki (Sakhalin) 

a. kilivliil, saa-ra-s ee-ra-s=kuu? 
girl know-AOR-2PL NEG-AOR-2PL=Q 
‘Girls, have you noticed?’ 

b. ile-ve=guu e-chee-s xenu-re? 
man-ACC-Q NEG-PST-2SG notice-CN 
"Haven't you noticed the man?’ 

c. nene-neet-y-vun=uu e-neet-y-vun-guu? 
g£O-DEB-E-]PL.EX-Q | NEG-DEB-E-1PL.EX=Q 
‘Should we go or not?’ 


d. eedaa ekeendeek-lee-vun eme-nni? 
why  dancing.place-Loc-1PL.EX.POss come-2sc 


"Why have you come to our dancing place?' (Bulatova & Cotrozzi 2004: 64, 
72, 19, 12) 


There is also an enclitic =too of unclear origin that marks polar questions and does 
not seem to exist in Standard Evenki (Nedjalkov 1997). 


(305) Evenki (Sakhalin) 
ta.r-gachiin beje-nelii kuxii-neet-y-c=too? 
that-EVAL man-com fight-DEB-E-2sG=Q 


"You are supposed to fight with such a man?’ (Bulatova & Cotrozzi 2004: 19) 


Khamnigan Evenki preserves the original enclitic as =gv but differs from Evenki in 
having borrowed both Russian =li (§5.5.2.2), as well as the corrogative marker bei from 
Khamnigan Mongol (85.8.2). In Mongolic the marker is derived from the copula and even 
in Khamnigan Evenki seems to be mutually exclusive with the autochthonous copula bi- 
. The fact that Khamnigan Evenki =gv does not show a variation, as in Evenki, may 
indicate influence from Khamnigan Mongol. 
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(306) Evenki (Khamnigan) 


a. aya bi-si-ndi-gv? 


good be-PRS-2SG=Q 


"Are you well?' 


. &r-si nii bei? 


this-2sc.Poss who Q 
"Who is this (one of yours)?' 


. sii nii bi-si-ndi? 


2sc who be-PRs-2sG 
"Who are you?' 


. murin-li, vkvr-li? 


horse=Q cow=Q 


‘(Is it) a horse or a cow?’ (Janhunen 1991: 95f.) 


5.10 Tungusic 


In Even the enclitic has the variants =gu, =ku, and =nu (Malchukov 1995: 19) and as 
in Evenki marks polar, focus, and alternative questions. Content questions remain un- 


marked. 


(307) Even 


(308) 


a. i-d’i-m=gu? 


enter-FUT-1sc-Q 
‘Shall I come in?’ (Malchukov 2001: 165) 


. tiniv=gu em-ri-n? 


yesterday=Q come-PsT-3sc 


"Did he come yesterday?' (Andrej Malchukov p.c. 2013) 


. uliki-w-gu bu-ri-s, hulica-m-gu? 


squirrel-acc=Q give-PsT-2sc fox-acc=Q 


‘Did you give (him/her) a squirrel or a fox?’ (Benzing 1955: 111) 
Even 

etiken ` Ge hor-re-n? 

old man which-ArL go-NON.FUT-3sG 
"Where has the old man gone?' (Malchukov 1995: 19) 


Even has a further question marker -i (Malchukov 2008: 138) with possible parallels in 
Negidal, Solon, and, less likely, Uilta. Dialects of Even show basically the same question 
marking patterns. Consider the following examples from the western dialect area. 


(309) Even (Western) 


a. ta.wa.r!e.k bi-d'i-n? 


that what cop-FUT-3sG 
‘What is it?’ (said in riddles) 
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b. oldo-ur oldo-mi-s=gur? 
fish-acc fish-v-2sG=Q 


‘Do you catch fish?’ (Sotavalta 1978: 30, 28, simplified) 


For the eastern dialect area (from the river Anadyr), Schiefner (1874: 217) has an ex- 
ample of an alternative question without a question marker but with what appears to 
be a disjunction tómi. Given that disjunctions are very rare in the northern part of NEA 
(86.4), but also exist in Kolyma Yukaghir (85.14.2), an areal connection seems possible. 
Malchukov (2001: 179) argues that imperative sentences in Even “may be used in inter- 
rogative sentences to ask for permission". 


(310) Even 
kosci-de-ku? 
fetch-FUT.IMP-1sG 
‘Shall I fetch (the reindeer)?’ (Malchukov 2001: 165) 


This might indicate a certain connection to the Chukotko-Kamchatkan languages in 
which there is a general affinity of imperatives to question marking (§5.3.2). 

Matić (2016) claims that Even has a special category of tag questions that developed 
out of the negative verb. 


(311) Even (Tompo) 
adzit=ta, ta-la ^ ih-ha-p e-he-p? 
truth=?and that-ALL arrive-NFUT-1PL NEG-NFUT-1PL 


‘And indeed, we have arrived there, haven't we?’ (Matić 2016: 171) 


It may be noted that the construction actually has the form of an elliptical negative 
alternative question (or not?), but with juxtaposition instead of double marking. This 
certainly explains the fact that, as in many other examples from Tungusic languages, the 
negative verb takes the same suffixes as the lexical verb. An interesting phenomenon is 
the optional presence of a contrastive or adversative enclitic C=kA ~ V -kkA that precedes 
the negative verb, but is not restricted to questions (Benzing 1955: 112). 


(312) Even (Tompo) 
hii-kke | e-he-ndi e.re.k kunaa-w | Corda-ndi? 
2SG=CONTR NEG-NFUT-2sG this child-Acc beat-2sc 
"You beat up this child, didn't you?’ (Matić 2016: 172) 


The word order in this last example (312) is indeed problematic for the analysis as alter- 
native question and strongly speaks in favor of Matic’s (2016) analysis, although there 
are other examples with relatively free word order above (e.g., 307). 

For Arman, Doerfer & Knüppel (2013)—the only source readily available—do not have 
examples for any question type. Given its very close relation to Even, we may speculate 
that the marking of questions was similar. However, several interrogatives are attested 
and will be presented in §5.10.3. 
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Questions in Orogen are usually marked with an enclitic that has the variants =nee ~ 
=nEE after nasals and =jee ~ =j£E in all other positions (Hu Zengyi 2001: 157). It cannot be 
cognate with Evenki =Ku which exists in Oroqen as well. Most likely it has a connection 
to =yee in the Mongolic language Dagur (but see Whaley 2005). The enclitic marks polar 
and alternative questions. No instance has been found where it marks focus questions. 


(313)  Oroqen (Nanmu) 
a. tari naan nammu-ni boi-pee? 
3sc also PN-GEN  man-Q 
‘Ts (s)he also from Nanmu?' 
b. miin  agdi nin.ut=yee unaadzi=nee? 
SUPERL old boy=Q girl-o 
‘Is your oldest (child) a boy or a girl?’ (Chaoke D. O. 2007: 141, 152) 


In the Shengli dialect, the enclitic has a variant =ni after nasals. This throws some 
doubt on the connection with Dagur but opens up the possibility of a comparison with 
Solon -gi(i). 


(314) Oroqen (Shengli) 
olo-jo pi-xi-n=Ni? 
fish-PART COP-PRS-3SG=Q 
‘Is there any fish?’ (Han Youfeng & Meng Shuxian 1993: 307) 


Content questions do not have the enclitic and are unmarked morphosyntactically as 
in Evenki, and Even. This appears to be a difference to Dagur, but as we will see for the 
Xunke dialect of Oroqen below, the enclitic optionally also marks content questions. 


(315) Oroqen (Chaoyang) 
fii tkon ds;aalin a.laa ama-tfa-j? 
2sc what reason here come-PsT-2sc 


"Why did you come here?' (Hu Zengyi 2001: 148) 


Another enclitic has the form -oo and expresses a certain fear that something has 
happened (Hu Zengyi 2001: 157). 


(316) Oroqen (Chaoyang) 
tari jabo-t[aa-o0? 
3sG go-PST-Q 
‘Is (s)he going?’ (Hu Zengyi 2001: 157) 


Quite clearly, this is a loan from Mongolian =UU that may have acquired a special se- 
mantics in Oroqen. Alternative questions may be marked with the enclitic 2joomaa ~ 
=joomaa that is of Mongolic origin and may combine with a cognate of Evenki =Ku. 
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(317) Oroqen (Chaoyang) 
a. ari bajta tadzaa joomae, olook joomaa? 
this matter real o false Q 
‘Is this matter true or false?’ 
b. ari baja | onaadsr-tfi joomaa.goo, uta-tfi joomaa.guu? 
this person girl-Poss Q boy-POss Q 
‘Does he (this man) have a girl or a boy?’ (Hu Zengyi 2001: 158) 


Apparently, Orogen also has borrowed the Chinese marker ba IE, but sometimes has 


two vowel harmonic variants baa and baa. It has a long vowel as in some varieties of 
Khorchin, Dagur (85.8.2), and Solon (see below). 


(318) Oroqen (Chaoyang) 
ari morin aja | morin baa? 
this horse good horse Q 


‘This horse is a good one, right?’ (Hu Zengyi 2001: 157) 


Orogen has also borrowed the Chinese interrogative disjunction háishi 4 ‘or.Q’ for 
alternative questions. As in Chinese, no additional question marker is present in this 
example. 


(319) Oroqen 
yabu-fa hafi ya-fa? 
walk-pst or what-psT 


‘Did you go or what?’ (Li Fengxiang 2005: 56) 


A slightly different picture can be drawn for the Xunke dialect of Orogen, which has 
a large amount of question markers (Zhang Yanchang, Li Bing, et al. 1989: passim). The 
enclitic =j£ marks polar and, optionally, content questions, which makes a connection 
to Dagur clear. One of their examples given is an alternative question that contains the 
two markers =jə and =ja. These must be vowel harmonic variants of -jr. Thus, the en- 
clitic is even more similar to some subdialects of Dagur that also exhibit vowel harmony 
in this form. Xunke Orogen has likewise borrowed the markers =99 (expressing doubt) 
from Mongolian =UU, and perhaps baa ~ baa from Chinese ba WÉI Alternative questions 
may either be marked twice with one of the two markers ssmal and jooma or may take 
a disjunction aaki that may either stand alone or may be combined with other question 
markers. The origin of 99mal is unclear but possibly may be treated as a variant of jooma. 
Furthermore, there is a tag question marker unti, which looks somewhat similar to the 
negative copula in Solon and Oroqen that developed out of an adjective meaning 'dif- 
ferent’ (Hólzl 2015a). However, in Xunke Orogen, the forms are onto ‘NEG’ and wonto 
‘different’ (Zhang Yanchang, Li Bing, et al. 1989: 183). 


(320) Oroqen (Xunke) 


a. noonin toroki=ja waa-tca aaki gujtcan=ja? 
3sG  boar-o kill-Psr or roe.deer=Q 


‘Did (s)he kill a boar or a roe deer?’ 


292 


5.10 Tungusic 


b. cii iro nana-ni=jE? 
2sc where go-2sc-Q 
"Where are you going?’ 

c. bii umun.na nana-tca-w, unti? 
1sG.NOM once go-PsT-lsc right 


' have been there once, right?’ (Zhang Yanchang, Li Bing, et al. 1989: 123, 
126, 131) 


In Huihe Solon, there is an enclitic =gi(i), which is accompanied by an additional rising 
intonation. It marks polar and alternative questions. 


(321) Solon (Huihe) 
a. eri üxür aya-gii? 
this ox  good-o 
‘Is this ox good?’ (Tsumagari 2009a: 7) 
b. fin-i baj-fi aja=gi, aru=gi? 
2SG.OBL-GEN body-2sc.Poss good-Q bad=Q 
"Are you well (or sick)?’ (Chaoke D. O. 2009: 316) 


Despite functional, formal, and distributional similarities, Solon =gi(i) and Evenki =Ku 
are probably not direct cognates of each other because there is no sound law that would 
justify the different vowel qualities (e.g., Benzing 1956; Doerfer 1978b). Maybe it is a loan 
from a Mongolic language, e.g. Buryat =gii (§5.8.2). Content questions usually do not 
show any marking. 


(322) Solon (Huihe) 
sii iilee tegeji-ndi? 
2sG where live-PRs.2sc 
"Where do you live?' (Tsumagari 2009a: 15) 


Like Oroqen, Solon also has a marker baa with a long vowel that must derive from 
Chinese ba WÉI and a form yeeme that, similar to Khorchin Mongolian jimee, can also 
appear in content questions (§5.8.2). 


(323) Solon (Huihe) 


a. ta.ri üli-see baa? 
3sG go-PST Q 
*He went, didn't he?' 
b. eri si oxon yeeme? 
this cop what Q 


"What is this?' (Tsumagari 2009a: 15) 
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As in Orogen, alternative questions appear to preserve a cognate of Evenki =Ku. Con- 
sider the following negative alternative question. 


(324) Solon (?Huihe) 
fü morin-fi baja=guu, aafin-goo? 
2sc horse-2sG.POss man-Q | NEG-Q 
‘Do you have horses or not?’ (Hu Zengyi & Chaoke D. O. 1986) 


There is limited information on other dialects of Solon, especially the Ongkor dialect 
formerly spoken in Xinjiang. However, there apparently were morphosyntactically un- 
marked questions that probably had a special intonational contour, e.g. si mandii? Are 
you strong?' (Aalto 1979: 11) In addition, there are two forms -ii and -uu, both of which 
are probably loans from Mongolian =(y)ii ~ =(y)UU (85.8.2). In Even, Negidal, and Uilta 
there are markers similar to - ii (see below). Content questions remain unmarked. 


(325) a. Solon (Ongkor) 
ar uktu ulu-r uktu-ii? 
this road walk-PrcP road=Q 
'Is this road the road (usually) travelled?' 
b. baxuu-dze=uu e-dze=uu? 
find-Psr-Q ` NEG-PST-Q 
"Was it found or not?’ 
c. jam isi-ndii? 
which one see-PRS.2SG 


‘What do you see?’ (Aalto 1979: 8, 9, modified transcription) 


The interrogative jam in (325c) is probably a loan from Jurchenic that can also be seen 
in Nonni Solon as jemu (326b, see §5.10.3). There is even less information on the Nonni 
dialect of Solon. Nevertheless, at least some examples have been collected by Ivanovskij 
(1982 [1894])). One dubious example of a negative alternative question apparently relies 
on juxtaposition. Several content questions remained unmarked as well, and an optional 
polar question marker has the form =gi. 


(326) Solon (Nonni) 
a. Sin-i enin ` $amine Zu-de bi-si-n=gi? 
2SG.OBL-GEN mother ?father house-Loc COP-PRS-3=Q 
‘Are your parents at home (still alive)?’ 
b. jemu gerbi-ci? 
which name-2sc.Poss 
"What is your name?' (Ivanovskij 1982 [1894]: 1) 


Except for Oroqen, Negidal is probably the most aberrant Ewenic language with re- 
spect to question marking. At first glance, the situation is similar to Evenki as there is 
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a marker that is cognate with =Ku. Note the absence of the consonant in the form -oo, 
which, like Ongkor Solon -uv, is quite similar to Mongolian. There are also unmarked 
polar questions that probably have an intonation similar to Evenki. 


(327) Negidal 
a. nonan naa.baja-ni=nuu, naa-ni=00? 
3sG | PN-3sG.POSS-Q  PN-3SG.POSS=Q 
‘Is he a Negidal or a Nanai?' 
b. sii saa-s? 
2sc know-2sG 
‘Do you know?’ (Kazama 2002a: 80, 65) 


But Khasanova & Pevnov (2003: 10) mention a morphological marking of questions 
in Negidal as in the following example. Incidentally, the example also contains a further 
question marker =i (cf. example 327a from Even above). 


(328) Negidal 
ii-ja-m=i? 
enter-FUT.Q-1sG.Q=Q 
‘Shall I come in?’ (Kazama 2002a: 127) 


According to them, the interrogative future differs from the general future in two 
points. First, the interrogative future has a short vowel as opposed to the plain future. 
Second, a different personal ending is employed (e.g., 1sG -m instead of -v). Compare the 
following pair of sentences: 


(329) Negidal 
a. eeva iche-a-m? 
what see-FUT.Q-1SG.Q 
"What will I see?’ 
b. o%a-va iche-3ee-v. 
track-acc see-FUT-1sG 
‘I will see the tracks.’ (Khasanova & Pevnov 2003: 10) 


The morphological interrogative marking is found in polar, content, as well as in alter- 
native questions and can combine with interrogative enclitics. Consider the following 
open alternative question with both morphological and enclitic markers. 


(330) Negidal 
mozhet bolotki bi-ja-m-gu ee-ja-m-u? 
maybe autumn be-FUT.Q-1sG=Q what-FUT.Q-1sG=Q 


‘Is it perhaps already autumn or what?’ (Kazama 20022: 114) 
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Previous descriptions of Negidal apparently did not mention this interesting feature 
(see Kazama 2002a: 107, 114, 115). According to Ikegami (1985), in Tungusic languages 
there are generally two different sets of personal endings (Table 5.111). In Negidal, Set 
1 is used after past forms in -čaa as well as future forms in -ja(-naa) and also has a 
possessive function with nouns. Set 2, on the other hand, can be found after present stems 
in -ja or underived stems. Ikegami (1985: 91) also notes that, according to Kolesnikova & 
Konstantinova, the future ending -ja takes the first person inclusive marker -p instead 
of -t. This might indicate a confusion resulting from the interrogative marking and may 
show that Khasanova & Pevnov’s (2003) assumptions are correct. 


Table 5.111: Personal endings in Negidal according to Ikegami (1985: 88f.), from 
Cincius, adjusted 


Set 1 Set 2 
1sG bal -m 
2SG -S -S 
3SG -n -n 
1PL.INCL -t -P 
1PL.EXCL -vun -vun 
2PL -sun -sun 
3PL -tin - 


Accordingly, Set 2 would additionally be used in interrogatives, while Set 1 is found 
in declarative sentences. There does not appear to be any further description of this phe- 
nomenon for Negidal, or for any other Tungusic language for that matter. However, a 
possible areal connection can be found in Yukaghiric (§5.14.2). As in Negidal, the Yuk- 
aghiric interrogative suffixes are restricted to the first person (singular -m, plural -uok ~ 
-ook). But the connection to Yukaghiric is not without its problems. First of all, Yukaghir 
languages are spoken several thousand kilometers north of Negidal and in Yukaghiric 
the special interrogative suffixes are only found in content questions. Furthermore, Yuk- 
aghiric lacks any special interrogative tense markers. But as specified in §2.14 we may 
assume that Yukaghiric was once spoken in a much larger territory and that its speaker 
probably migrated northward along the Lena river from an earlier location close to Lake 
Baikal, which reduces the distance to the Negidal. But even if the areal connection turns 
out to be wrong, we are dealing with an interesting typological parallel in which inter- 
rogative agreement marking is mostly restricted to the first person and the third person 
plural remains unmarked. 

In Udihe polar questions can be marked by intonation only, which is said to be higher 
and somewhat longer than that of declarative sentences (Nikolaeva & Tolskaya 2001: 
807). An element may be moved to a focus position, typically in front of the verb, which 
is different from Evenki as seen above (Nikolaeva & Tolskaya 2001: 841). 
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(331) Udihe 
uti nii monao-ni A bagdoa-mi ` bua-la? 
that man go.PsT-3sc 1sc.NOM born.PsT-1sc place-Loc 


‘Has he (the man) gone to my birthplace’ (Girfanova 2002: 41) 


An alternative is the use of an enclitic =nu ~ =gu, cognate of Evenki -Ku, that attaches 
to the verb in polar questions and to the the element in focus in focus questions. As in 
Evenki, the scope of the marker also encompasses alternative questions. 


(332) Udihe 
a. Iwana zugdi-du  bi-s'e-nu? 
Ivan house-Loc COP-PFV=Q 
‘Is Ivan at home?’ 
b. si bagáü-za-nu bi-s'e-i? 
2sc other.side-N-Q COP-PFV-2SG 
"Were you on the other side of the river?’ 


c. xeleba bie-nu anci-nu? 
bread be.PRs.HAB-Q NEG=Q 


‘Is there bread or not?’ (Nikolaeva & Tolskaya 2001: 809, 812) 
Content questions do not normally take any morphosyntactic marking. 


(333) Udihe 
j'e-le fiansule-i? 
which-Loc study-2sc 
"Where do you study?’ (Nikolaeva & Tolskaya 2001: 801) 


The semantic scope of =nu is thus identical to Evenki, but Udihe has a further en- 
clitic =nA that has a contrastive function. It remains dubious whether this form has any 
connection with Manchu =nA (see below). 


(334) Udihe 
xuda=na? 
fur=Q 


‘And what about the fur?’ (Nikolaeva & Tolskaya 2001: 808) 


The enclitic is also often used together with an interrogative word. Within the follow- 
ing question the contrastive focus lies not on the interrogative but on the river. 


(335) Udihe 
ei-ne j'e.u bdsa-ni? 
this=Q what river-3sc 
‘And what is this river called?’ (Nikolaeva & Tolskaya 2001: 808) 
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Alternative questions may also be formed with -(e)s(i) of unknown origin. In example 
(336a) a content question is followed by an alternative question (§4.4). 


(336) Udihe 
a. j'e-we ` xokto-ni, kdnaa-s ogböö-s? 
what-Acc footstep-3sc.Poss deer-Q elk-Q 
"Whose footsteps are these, a deer's or an elk's?' 
b. iwana sin-du kusige-we bu-ge, —e-si-ni bu-oo-s(i)? 
PN ` 2SG.OBL-DAT knife-Acc give-PFV neg-PST-3sG give-PST-Q 


‘Has Ivan given you the knife or not?’ (Nikolaeva & Tolskaya 2001: 811, 255) 


This latter construction is probably not a tag question construction but an alternative 
question with a question marker on the second alternative only, which is also attested 
for Kilen and Manchu. 

According to Nikolaeva & Tolskaya (2001: 351), there are tag questions that are formed 


with the help of the interrogative j'e.u ^what', which may be attributed to Russian influ- 
ence (cf. 5.5.2.2). 


(337) Udihe 
em e-i, jeu? 
come.PFV-2sG what 


‘You came, didn’t you?’ (Nikolaeva & Tolskaya 2001: 351) 


Udihe and Oroch have a very interesting open alternative question construction in 
which the second alternative is an inflected interrogative verb. This pattern has been 
adopted by Kilen from Udihe. We have already observed a similar construction in Oroqen 
(319), but with the Chinese disjunction instead of double marking. 


(338) Udihe 
su xulisee-u=nu jaa-u=nu? 
2PL go.PST-2PL-Q what.-2PL=Q 


‘Did you go, or what?’ (Nikolaeva & Tolskaya 2001: 811) 


(339) Kilen 
su ana-xai=na ja-xai=na? 
2PL go-PERF-Q what-PERF=Q 
‘Did you go or what?’ (Zhang 2013: 158) 


(340) Oroch 
agduci-za-i-nu jaa-za-i-nu? 
tell-FUT-1sG=Q what-FUT-1sG=Q 
‘Should I tell or what?’ (Tolskaya & Tolskaya 2008: 98, from Avrorin) 
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Example (340) from Oroch shows the interrogative enclitic which, similar to Nanai and 
Udihe, has the form =nu and is optional in polar questions. Content questions remain 
unmarked. 


(341) Oroch 

a. sii Cihala-i-si? 
2SG agree-PRS-2SG 
‘Do you agree?’ 

b. sii jaa-va  xuana-i-si? 
2sc what-acc argue-PRs-2sG 
"What do you argue about?' 

c. sii amba bi-si-nu? 
2sG evil.spirit cop-PRS=Q 
‘Are you an evil spirit (the devil)?’ (Avrorin & Boldyrev 2001: 184) 


In 1958, a team of unknown scientists from China gave a handful of comparative word 
lists for five Tungusic languages in China. Their list also contains two sentences that can 
be translated as ‘when do you come back?’ (4 /E [H|2) and ‘where do you go?’ (NDSSLD 
1958: 82). Unfortunately, they transcribed all languages with the help of Chinese char- 
acters, which makes the analysis less easy. Additionally, some characters were written 
incorrectly. The following gives the corrected sentences in Chinese transcription and its 
rendering in official Pinyin spelling followed by a rough approximation of the original 
languages. The transcription, analysis and glossing is mine. Interestingly enough, the 
set of languages is not completely identical to the five officially recognized languages 
today. There are no sentences from Sibe, but from Hezhen (hézhen if} ££) which refers to 
the dialect of Nanai spoken in China. Hezhen is not very well known (cf. An Jun 1986: 
79-86) and probably extinct by now, while Kilen (qiléng #1##) has been described in sev- 
eral grammatical sketches. The Hezhen data are thus potentially very important. Both 
Hezhen and Kilen are classified together as the Hezhe (hézhé ij) language and are 
treated as dialects by the authors of NDSSLD (1958). Of the five languages only Hezhen 
and Kilen are included here for the sake of brevity. 


(342) Hezhen 
a. WGA AAG [ha li ji zhu yi xi] 
xali dzi-dzu-i-ci? 
when come-REGR-PRS-2SG 


"When do you come back?’ 
b. ZANA [hao xi e nu yi xi] 
xaogi X ena-i-ci? 
where.to go-PRs-2sG 
"Where do you go?' (NDSSLD 1958: 82) 


299 


5 Survey of the grammars of questions in Northeast Asia 


(343) Kilen 


a. PEREKA [a hei? e mo tu yi xi] 
ali | ema-tu-i-ci? 
when come-REGR-PRS-2SG 


"When do you come back?’ 
b. RES ARAN AC [ya le e ne yi xi] 
ya-le ene-i-ci? 
which-ALL go-PRS-2SG 
"Where do you go?’ (NDSSLD 1958: 82) 


In both languages content questions do not take any morphosyntactic marking. As 
will be further explained in §5.10.3, Kilen interrogatives exhibit affinities with Udegheic, 
which explains the absence of the initial consonant in ali, as opposed to Hezhen hali 
‘when’, and the interrogative yale, instead of Hezhen xaosi ‘whither’ (Udihe ali, j'ele, 
Nanai xaali, xaosi).?° Schmidt (1928b: 241) mentions the Samar sentence xajadzi dzidzisi? 
‘Where did you come from?’ Samar is not very well-known, but is clearly very similar 
to Nanai as well eg. Nanai xajaji ‘whence’). 

There are several descriptions of Kilen that differ more or less strongly from each other. 
According to Zhang (2013: 157f.), Kilen expresses polar questions with rising intonation 
on the last word of the sentence, e.g. ci sa? ‘Do you know?’. However, Kilen was heavily 
influenced by Chinese, in fact, Chinese may by now have replaced Kilen completely, leav- 
ing Kilen extinct. Following to Zhang (2013: 158), Kilen borrowed the three interrogative 
particles ba IE, ma "14, and (y)a DA all of which are possible in the sentence above, 
e.g. ci sa=a? ‘Do you really know?’. Most likely, =a is not of Chinese origin, however. 
Several examples of polar questions in Zhang Yanchang, Zhang Xi, et al. (1989) were 
either unmarked (showing rising intonation) or marked with the final question marker 
-a. Note that it never followed anything but the second person singular agreement form 
-ci and was always written attached to it. Nevertheless, it is better analyzed as enclitic 
-a that may appear in both polar and content questions, which might speak instead in 
favor of a connection with Manchu -o. 


(344) Kilen 
a. ci arki-wa omi-mi  bi-ci-a? 
2sc alcohol-Acc drink-cvB CoP-2sG=Q 
"Are you drinking alcohol?' 
b. ia-lo ana-ji-ci=a? 
which-ALL go-PRS-2SG=Q 
‘Where are you going?’ (Zhang Yanchang, Zhang Xi, et al. 1989: 87f.) 


25The character hei $ should instead read li Bi. 

?6Both Hezhen and Kilen show characteristics that suggest a basic connection to Nanai, e.g. the absence of 
an initial consonant in ene- ‘to go’ (Nanai ana-, Udihe nene-, Manchu gene-). The verb ema- ‘to come’ in 
Kilen was most likely borrowed from Udihe (Nanai ji-, Udihe eme-, Manchu ji-). 
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Note the absence of the marker in the otherwise identical sentence (47b) above. An 
Jun (1986) already mentions examples with the Chinese enclitic ba ME. In his data, a is not 


written attached to the preceding word and can also follow forms other than the second 
person singular. 


(345) Kilen 
ci.n-i aga-ci biyan fuli-m ` an-y-ni=a? 
2SG.OBL-GEN e.brother-2sc.Poss wilderness hunt-cvB go-Psr-3sc-Q 
"Did your elder brother go to hunt?' (An Jun 1986: 36) 


In alternative questions the Chinese interrogative disjunctive háishi EB ‘or.Q’ may 
be employed but is combined with the marker =a. 


(346) Kilen 
ci 2i-wa  xogiti-wa ` gala-ji-ci=a? 
2sc this-Acc or.Q that-Acc ?want-PRs-2sG=Q 
‘Do you want this or that?’ (Zhang Yanchang, Zhang Xi, et al. 1989: 45, 
simplified) 


Unlike other alternative question constructions among Tungusic languages, the ques- 
tion marker appears only once and does not attach to the elements in focus. 

A further enclitic called a “contrastive particle" by Zhang (2013: 159) seems to have 
been borrowed from Udihe and marks polar and focus questions. 


(347) Kilen 


a. ci adoqoli-no? 
2sc cold=Q 


‘Are you cold (or not)?’ 


b. sursaca=na talaxa? 


tasty=Q  grilled.fish 
‘Is the grilled fish tasty (or not)?’ (Zhang 2013: 159) 


As seen in example (365) above, it also marks alternative questions. Most likely it has 
been borrowed from Udihe =nu, but it exhibits certain similarities to Udihe =nA as well. 

Nanai is the best described language from the Nanaic branch and there is even a good 
description of question intonation by Baitchura (1979: 294) (underlining removed). 


In general questions, the tone movement in the vowel of the final syllable has a 
clearly and strongly manifested rising character, whereas the mean and the maxi- 
mal tone heights surpass those of the preceding vowels in cases in which no inter- 
rogative particle is present in the sentence. If there is such a particle (e.g., nu), the 
rise of the tone at the end of the sentence is not so high, its pitch being a little lower 
in comparison to the tone heights of vowels at the beginning of the sentence. 
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There are sentences with and without the enclitic. In 1858, Venjukov recorded a polar 
question without enclitic among the Ussuri Nanai. 


(348) Nanai (Ussuri) 
anda, duman bira goró? 
friend PN river far 


‘Friend, is the Duman river far away? (Alonso de la Fuente 2011: 14, from 
Venjukov) 


Similar to Evenki or Udihe, the enclitic =nu in Nanai does not appear in content ques- 
tions (which remain unmarked), but marks more than one question type, including polar, 
alternative, and possibly focus questions. 


(349) Nanai (Najkhin) 
a. swə aja-so=nu? 
2PL good-2PL.POSS=Q 
‘Are you all well?’ 


b. swə sogdata-wa wanda-mari manbo-ci=nu namo-ci=nu ana-i-su? 
2px fish-acc — take-CVB.PL river-ALL-Q sea-ALL=Q  go-PRS-2PL 


‘Do you go to the river or sea to catch fish?’ 
c. si xali ausi ji-ci-si? 

2sc when here come-PsT-2sG 

"When did you come here?' (Ko & Yurn 2011: 155, 68, 52) 
Regarding the last sentence compare example (370) from Hezhen above. 

In Ulcha the enclitic marks focus, alternative, and (optionally) content questions. Such 

an extension of scope can also be observed in Mongolian ($5.8.2). 
(350) Ulcha 


a. loca gasa-noo? 
PN village-o 


‘(Is it) a Russian village?’ 


b. xasu arían-noo, nadan arían-noo hungun aran=noo bi-Ci-ni? 
how.many year-Q seven year-Q six year-Q | COP-PST-?3sG 
'How many years has it been, seven years (or) six years?' (Kazama 2002b: 79, 
86) 

c. saanxai, xai.mi song-i? 

PN why  cry-?pns 


‘Sanghai, why are you crying?’ (Schmidt 1923b: 235f.) 


Within Nanaic, Uilta has the most interesting marking of questions. Polar questions 
in Uilta have both rising intonation and an interrogative clitic =(y)i that might be re- 
lated to the one found in Even, Negidal, and Ongkor Solon, although these were perhaps 
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borrowed from Mongolic. It always follows the verb (Patryk Czerwinski p.c. 2018). In 
addition, there is a specialized marker =ga ~ =ka for content questions that cannot be 
found in any other Tungusic language. 


(351) Uilta (Southern) 
a. eri aya ulaa=yi? 
this good reindeer=Q 
‘Is this a good reindeer?’ 
b. gui sinda-xa-ni=ga? 
who come-PST-3SG=Q 
‘Who came?’ (Tsumagari 2009b: 15) 


Apparently, the marker in content questions is not obligatory as there are also several 
examples without it. 


(352) Uilta (Northern) 
khoni bi-si sii? 
how  coP-Pns2sG 


‘How are you??? (Funk 2000: 150) 


The Uilta polar and content question markers can almost certainly be attributed to 
influence from Amuric (see Sections 3.1 and 5.2.2). Within Ikegami's (1997) dictionary 
there are not only examples with =ga, but also with a marker =gaa. 


(353) Uilta 
tari nari | gui-ga2? 
that person who=Q 
‘Who is that person?’ (Ikegami 1997: 145) 


However, riddles recorded by Ikegami contain yet another variant with a final -k. 


(354) Uilta (Southern) 
xai=gaak? 
what-Q 


"What is (this)?' (Ikegami 1958: 93) 


The origin of the final -k, which can also appear in children's games, remains partly 
unclear and other examples with similar constructions exhibit the marker -ga, instead. 


(355) Uilta (Southern) 
eri xai-ga? 
this what-Q 
"What is this?' (Tsumagari 2009b: 15) 


"Regarding the use of the interrogative, cf. Russian kak dela?/xax nena? 
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Patryk Czerwinski (p.c. 2018) was so kind to check with some of the last speakers of 
the northern dialect. According to his fieldwork, the variant with =gaak is still used as 
an "embellishment" of questions and is marked with respect to the other variants. xai 
tari?, xai=ga tari?, and xai=gaak tari? are said to have more or less the same meaning 
‘What is that?’. 

Nakanome (1928: 21, 50) already mentioned two different forms, the unproblematic 
form «ga» and yet another variant written as «nó» that was probably pronounced with 
a velar nasal [n] and a vowel quality comparable to the form =gaa recorded by Ikegami, 
ie. [go] (see 83.1). 


(356) Uilta 
hai-wó  gade-si-nó? 
what-acc buy-2sc-Q 
"What do you (want to) buy?' (Nakanome 1928: 52) 


In addition, there are variants with a fricative in intervocalic position, e.g. [ui-yo], 
[gui-ye(»)k] "who-Q' (Patryk Czerwinski p.c. 2018). Most likely, we are dealing with one 
enclitic that undergoes both vowel harmonic and consonant alternations depending on 
the preceding syllable (i.e. -KA(A)). In my eyes, Nivkh =na is the most likely source of 
this enclitic in Uilta (see 83.1). 

It is an open question whether =gaak is an independent form or a variant of -KA(A). A 
-k can also appear in answers to riddles and might be a suffix. However, the form =gaak 
apparently exhibits no vowel harmony and only appears in special contexts, which might 
suggest that it is in fact a different form (Patryk Czerwinski p.c. 2018). 

In the northern dialect, the question marker seem to be more strongly fused with the 
preceding elements (-cee < -či ‘3PL’ + -KA ol 


(357) Ultra (Northern) 


xooni to-li-Gee? 
how do-P.FUT-3PL.Q 


‘How are they doing?" (Yamada 2016: 192) 


No examples for alternative, focus, and tag questions have been found in the relevant 
literature (e.g., Ikegami 2002). According to Patryk Czerwinski (p.c. 2018), focus ques- 
tions do not show any difference with respect to polar questions. He elicited the follow- 
ing two alternative questions for me. The analysis roughly follows Tsumagari (2009b). 


(358) Uilta (Northern) 


a. sii xoo-tai ^ nappee-si, oskoola-tai yyuu, duku-takki yyuu? 
2sc which-pir go.Pns-2sc school-pir Q house-DIR.REFL.POSS Q 


"Where are you going, are you going to school or to your house?’ 


28 953 fU CU? in Japanese. For the use of the interrogative, cf. Russian kak dela?/Kax nena? 
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b. sii oskoola-tai nannee-si=yi, duku-takki yyuu? 
2sc school-prR go.PRS-2sG=Q house-DIR.REFL.POSS Q 


‘Are you going to school? Or to your house?‘ 


c. xai-wa  daptu-li-ssee, sundattaa, ulissa3 yyuu? 
what-Acc eat-FUT-2sG.Q fish.Acc meat.acc Q 


"What will you eat, fish or meat?' 


In the second example, there is the polar question marker =(y)i at the verb in the first 
alternative. The second alternative takes what appears to be a question marker yyuu. In 
the first example, because of the ellipsis of the verb, the marker yyuu is found on each 
alternative. In the third example, the marker yyuu only appears on the second and last 
alternative. The preceding content questions exhibits a fused question marker similar 
to the one seen before (perhaps -si + =KA > -see). The only possible question tag in 
Uilta is ii ‘yes‘ (similar to Russian), although this is difficult to identify, given the formal 
resemblance with the polar question marker =yi(i) (Patryk Czerwinski p.c. 2018). 

Question marking similarly aberrant to that in Uilta can be observed in the entire Ju- 
rchenic branch, but especially in Written Manchu. For Manchu, book three of the Qing- 
wen Qimeng (Wuge Shouping & Cheng Mingyuan 1730, translated by Wylie 1855) lists a 
number of interrogative forms: na, ne, no, nu, ya, all of which are probably enclitics. The 
first three must be vowel-harmonic variants of one form =nA, which is similar to Udihe, 
although a connection remains doubtful. The enclitic =nu may be cognate of Evenki =Ku, 
Udihe =nu, and Nanai =nu, but is not often encountered. 


(359) Manchu 

a. waka-ra-raku=na? 
mistake-v-P.IPFV.NEG=Q 
“Will (you) not blame (me) then?’ 

b. gene-raku=ne? 
£0-P.IPFV.NEG=Q 
"Will (you) not go?’ 

c. ojo-raku=no? 
become-P.IPFV.NEG=Q 
"Will it not do?’ 

d. gisu-re-raku=nu? 
word-v-P.IPFV.NEG=Q 
"Will (you) not speak?’ (Wuge Shouping & Cheng Mingyuan 1730; Wylie 
1855: 171) 


The marker =ya is also not very frequent and, might have been borrowed from Chinese 
=(y)a W/E}, e.g. inu=ya? ‘Is it so?’ It appears that =ni and =o are not only the most neutral 
but also the most frequent question markers. There is not much information about the 
two enclitics, but both appear in polar, alternative, and content questions, which makes 
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Manchu different from most other Tungusic languages, but not Oroqen. The enclitic =ni 
has the reduced form n after the existential negator aku. 


(360) Manchu 


a. ere gese baita geli bi-ni? 
this like matter also cop=Q 


"Was there ever anything like this?' 


b. ere ai turgun=ni? 
this what reason=Q 


"What is the reason for this?' 
c. si ere bithe-be  hola-ki | se-mbi-o? 
2sc this book=acc read-oPr say-IPFV=Q 
‘Do you want to read this book?’ 
d. geren niyalma-i dorgi fala-ci aca-ra-ngge we-be-o? 
all person-GEN inside punish-conD meet-1PFv-N who-Acc-Q 
‘Who among all men is to be punished?’ (Wuge Shouping & Cheng 
Mingyuan 1730; Wylie 1855: 131, 133, 140) 
e. si min-de bu-mbi-o — bu-rakü.n? 
2SG 1SG.OBL-DAT give-IPFV=Q give-IPFV.NEG.Q 


‘Are you going to give it to me, or not?’ (Hauer 2007: 67, from the 
Jinpingmei) 


Possibly influenced by akün, the words sain ‘good’, tasan ‘false’, and yargiyan ‘true’ 
have the special interrogative forms saiyün, tasun, and yargiyün. The last example (360e) 
consists of a negative alternative question in which, unlike any other Tungusic language 
except Uilta, two different question markers may be employed. In Manchu, there is a 
wealth of such verb doubling constructions for questions in which the second verb is 
always negated (Table 5.112). Only one of these patterns marks both verbs with a question 
marker and two do not have any marker at all, which may be due to Chinese influence. 
In one case, the second alternative takes two markers. In most cases, there is one marker 
found at the second negated verb (cf. Kilen and Udihe above). 

Given its semantic scope and the possibility that =o alone may mark a negative alter- 
native question, a connection to Kilen -a seems possible. There are further constructions 
not mentioned by Gorelova, which have no reduplication of the verb. 


(361) Manchu 
a. gebu ali.bu-ha-o akün? 
name submit-P.PVF-Q NEG.Q 
‘Have (you) enrolled (for the exam) or not?’ (von Móllendorff 1892: 28) 


b. ere kemuni tolhin waka-o se-me-o? 
this still dream NEG-Q  say-CVB.IPFV-Q 


"Ihis isn't a dream or is it?' (Di Cosmo 2006: 87, 104, 131) 
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Table 5.112: Negative alternative question patterns in Manchu (Gorelova 2002: 
325f.); -mbi ‘IPFV’, -rA ‘P.IPFV’, -hA ‘p.pFv’, -hAku/-raki ‘NEG’, se- ‘to say’, bi- 


‘CoP’ 
V1 V2 AUX 
V-ra V-rakü 
V-ra V-rakü-n 
V-ra V-rakü bi-o 
V-ra V-rakü se-me-o 
V-ra V-rakü-nio 
V-ha V-hakü 
V-ha V-hakü-n 
V-ha V-hakü-n bi-he=o 
V-mbi V-rakü 
V-mbi V-rakü-n 
V-mbi-o V-rakü-n 
V-mbi-he V-mbi-hekü 
V-mbi-he V-mbi-hekü-n 


As opposed to the previous constructions, this last example (361b) has the same ques- 
tion marker used twice, which may be due to the presence of the negative copula waka, 
after which apparently only =o can be found. Aixinjueluo Yingsheng (1987a: 72) argued 
that a Mandarin interrogative construction with sentence-final you ma? GI ‘ex Q’, ap- 
parently found in the Peking dialect, is a calque of Manchu bi-o? 'Ex-Q". 

It is often claimed that the two markers =ni and =o may also be attached behind one 
another to form the complex marker =nio. 


(362) Manchu 
ere sain aku=nio? 
this good NEX=Q 


"Isn't this good?’ (Wuge Shouping & Cheng Mingyuan 1730; Wylie 1855: 134) 


This would be a very unusual pattern among Tungusic languages. But apart from this 
analysis into two question markers, which is a rather unexpected, there is a more plau- 
sible explanation that treats =nio as one marker that was borrowed from Korean (see 
below). Also remember that, following aku, the marker =ni usually takes the form -n. 

In Sibe, polar questions are regularly expressed with the enclitic =na that seems to 
correspond to the Manchu form =nA above but does no exhibit vowel harmony. It marks 
polar and alternative questions. In both polar and content questions, there is sometimes 
an element =ja that might correspond to Dagur =yee. But its status as a question marker 
remains rather dubious. Like many languages in China, Sibe has adopted the Mandarin 


question marker ba DÉI 
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(363) 


a. 


Sibe 


gi guldza-ji nan=na? 

2SG PN-GEN person=Q 

‘Are you from Yining?’ 

gi evsi=ja? 

2sc how=?Q 

‘How are you?’ 

tsahsa avqa-ji arvun evsi? 
yesterday sky-GEN form how 


'How was the weather yesterday?' (Chaoke D. O. 2006: 207, 280, 277) 


ar min dzaq-na,gi.n dzaq-na? 
this Isc.GEN thing=Q 2sG.GEN thing-Q 
'Is this mine or yours?' 


so.n ta.va mori" gum ambu ba? 
2PL.GEN there horse all big o 


'In your place all horses are big, aren't they?' (Zikmundová 2013: 49, 95) 


In general, there are very few descriptions of possible tag questions in Tungusic lan- 
guages. Sibe is somehow exceptional because at least two different tag question patterns 
were recorded. 


(364) 


The first type could be a calque and partial loan from Mandarin dui ma/ba X} Hu 


a. 


Sibe 


ta-s | canbi si.n-i-na 
that-PL pencil 2sG.0BL-GEN-N NEG right=Q 
‘Those aren't your pencils, right?’ 

tər am nan siy, mandzay=na? 

that one man teacher right=Q 


‘That one is a teacher, right?’ 


. si mini-d utsi-v li-maq sinda, 
2sG 1sG.OBL-DAT door-Acc open-cvs.Prv stay[IMP] be-IPFV=Q 


vaq, mandzayn=ba? 


‘Hold the door open for me, will you?’ (Chaoke D. O. 2006: 91, 102, 343) 


E. The 


latter type with the verb o- 'to become, to be, to be permissible' (Norman 2013) possibly is 


a calque of Mandarin kéyi ma PĪ LANS or xing ma 471 


zJ 


in Khorchin Mongolian (§5.8.2). 

Records of Sibe from the beginning of the 20th century that were strongly influenced 
by Written Manchu have been recorded by Muromski. They contain several question 
markers, =na(a), =ńu(u) ~ sell, =ü ~ ="u, and =o (Kałużyński 1977: 53). The marker =U 
might be of Mongolic origin and is the only one that is unknown from Manchu. It appears 


to have fused with the imperfective or dictionary form -mbi of Written Manchu. 
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(365) Sibe 
mi.n-i gala-ci ` tuci-mbü? 
1sG.OBL-GEN hand-ABL come.out-IPFV.Q 


‘Can you escape my hands?’ (Kałużyński 1977: 53) 


Sanjiazi Manchu also shows some of this variation. The following examples were col- 
lected in 1961 and contain the markers -no, -nu, and =ni. The last one seems to be re- 
stricted to content questions that are optionally unmarked, while the other two (=nU) 
appear in polar questions. Enhebatu treats them as variants of the same form. 


(366) Manchu (Sanjiazi) 

a. um baiti-bur ` dondzi-y-no? 
this matter-Acc hear-P.PFv-Q 
‘Did you hear about this matter?’ 

b. urluai — dza?xa-ni? 
this what thing-o 
"What (kind of thing) is this?' 

c. di  gurvvur? 
what name 
"What (is your) name?' (Enhebatu 1995: 300, 296, 55) 


Kim et al. (2008: 45), who did fieldwork in Sanjiazi in 2005 and 2006, recorded the 
markers =no and =na. They claim that the latter is a loan from Mandarin ne VE. Sanjiazi 
has also borrowed Mandarin ba IE. For alternative questions the Chinese disjunction 


háishi iŒ ‘or.Q’ has been adopted. 


(367) Manchu (Sanjiazi) 


a. ci qanqxy-Ey ba? 
2sc be.thirsty-».Prv Q 


"You must be thirsty, right?' (Enhebatu 1995: 55) 
b. najnaj Cicikal-dili ji-xa-ninna haishi xaalbin=dili ji-xa-ninna? 
grandmother PN-ABL  come-P.PFV-N OLQ  PN=ABL come-P.PFV-N 


‘Is your grandmother from Qigihar or from Harbin?’ (Kim et al. 2008: 214) 


Kim et al. (2008: 46) mention an enclitic =ja ~ =ja that they call an “‘intimacy’ particle". 
It may appear in questions but is not restricted to them. A connection to the Sibe and 
Dagur enclitic seems more likely than with Mandarin ya If. 
The Yibuqi dialect of Manchu presents a situation very similar to Sanjiazi Manchu. 
The usual question marker has the form -no, content questions remain unmarked, and 
the Mandarin disjunction may be employed in plain and negative alternative questions. 
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(368) Manchu (Yibuqi) 

a. pi puta tsa-mi=no? 
1sG.NOM food eat-IPFV=Q 
'(Can) I eat the food?’ 

b. cick/o və  tci-ya? 
yesterday who come-P.PFV 
"Who came yesterday?' 

c. ci tci-mi xesl pi kono-mi? 
2SG come-IPFV Or 1SG go-IPFV 


‘Do you come or do I go??? (Zhao Jie 1989: 164, 184, 188) 


The Yibuqi dialect additionally borrowed the Mandarin polar question marker ma 4. 


(369) Manchu (Yibuqi) 
so kam tci-ya mA? 
2PLall come-P.PFV Q 


‘Have you all come? (Zhao Jie 1989: 154) 


Aihui Manchu has the standard polar question marker =no. A form =je similar to 
Sibe is attested, but its meaning is not perfectly clear. Content questions usually remain 
unmarked. Alternative questions take the Mandarin disjunction hdishi 18 ‘or.Q’. 


(370) Manchu (Aihui) 

a. ci mandzo gisun ` bara.na-m-no? 
2sc Manchu language be.able-1prv=Q 
‘Can you speak Manchu?’ 

b. chat — jov-ra? 
what-DIR go-P.PFV 
"Where did (she) go?' 

c. ana gal  meri-m dzi-ya-je? 
why again return-CVB.IPFV come-P.PFV=?Q 
"Why have you come back again?' 


d. a.ra dzaqas| fa-ninna xes] itci-nna? 
this thing cop old-N  or.o new-N 


‘Is this thing old or new?’ (Wang Qingfeng 2005: 210, 229, 228, 243) 


For Aihui Manchu a tag question different from Sibe has been recorded. It may have 
been partly calqued from Mandarin dui bu dui XJ 4*3 or dui ba WHE, 


?? Here, the disjunction may also have the form sl 
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(371) Manchu (Aihui) 
bi aga-dara gam adzigan, ino vaqa=ba? 
1sc.NOM e.brother-ABL all little correct wrong=Q 
'T m smaller than all (my elder) brothers, isn't that right?' (Wang Qingfeng 2005: 
236) 


It may be noted that both inu and waka also function as positive and negative one 
word answers, respectively, in Written Manchu. 

The two languages Bala and Alchuka add important pieces to the puzzle. Both pre- 
serves a cognates of Manchu -o. Compare the following two sentences. 


(372) ?Bala 
cin nianli ai-va-t' bi-2? 
2sG.cEN washing.hammer what-place-Loc COP=Q 


(373) Lalin/Jing Manchu 
ci.n-i nijand; 'a.k'u ai-ba-de bi-x=9? 
2sG.OBL-GEN washing.hammer what-place-Loc COP-PFV=Q 


"Where is your washing hammer?" (Mu Yejun 1987: 25) 


Alchuka, in addition to =9, has a variant -ko with an unaspirated [k]. This form is re- 
lated to Manchu =o as well, as can be observed from a comparison of Alchuka ala-mei=ka 
‘fear-IPFV=Q’ (Mu Yejun 1986: 16) with Manchu gele-mbi=o (Aixinjueluo Yingsheng 1987b: 
15) that were attested in the same sentence. Bala also has a form -7o that is most likely 
cognate with Sanjiazi 2n», Aihui Manchu =no, Yibuqi Manchu =no, and Manchu =nio. 


(374) Bala 
a. ci ana-go? 
2SG go=Q 


"Are you going?' 
b. cin amin-go? 
2sG.GEN father=Q 
‘Is it your father?’ (Mu Yejun 1987: 31) 


Table 5.113 summarizes interrogative markers in Tungusic languages. Kyakala, Ju- 
rchen A, Jurchen B, Kili, and Arman have been excluded for lack of information. To 
the best of my knowledge, the origin of the Jurchenic question markers have never been 
described satisfactorily. But given their presence in Jurchenic, exclusively, and the lack 
of a good internal etymology, a borrowing from a neighboring language seems plausible. 
I argue that most of them (Manchu =o, -n(i), =nio, =nA) were perhaps borrowed from 
Koreanic, which had longstanding contacts with Jurchenic. The details are presented in 


2000 AC RRE in Chinese. Norman (2013) translates Written Manchu niyanca-ká as ‘a wooden stick for beating 
starched clothes while washing’. 
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§5.7.2. Manchu =nu might be inherited from Proto-Tungusic. Aihui Manchu -je, Sanjiazi 
Manchu -jA as well as Sibe -ja may have been borrowed from Dagur. The disjunction in 
Kilen, Orogen, and Manchu dialects was borrowed from Mandarin. 

Regarding the syntactic behavior of interrogatives in content questions, Malchukov 
& Nedjalkov (2010: 343f.) offer the following summary. 


Question formation need not involve WH-movement in Tungusic languages. For 
some languages, WH-fronting seems to be a preferred option, as for example in 
Evenki (Nedjalkov 1997: 7f.). For Even, on the other hand, WH-fronting is associ- 
ated with emphatic/rhetorical questions; in regular constituent questions the inter- 
rogative pronoun remains in situ (Malchukov 2008). In Written Manchu, question 
words also remain in situ (Gorelova 2002: 222). In Udihe (Nikolaeva & Tolskaya 
2001: 799), the position of focused elements including question words is strictly 
before the verb. 


However, note that, according to the description by Girfanova (2002: 42), Udihe be- 
haves like Evenki in putting the question word in sentence initial position. Even Niko- 
laeva & Tolskaya (2001: 799, 805) agree that the interrogative ii-mi/j’e-mi ‘why’ that is 
of converbal origin may optionally stand in clause initial position as well. 


5.10.3 Interrogatives in Tungusic 


Tungusic interrogatives have been treated in some detail before. The classical but partly 
outdated reconstruction can be found in Benzing (1956: 114f.). The most exhaustive lists 
of cognates that nevertheless lack many important data can be found in Cincius (1949: 
264ff.) and Cincius (1975/77). Kazama (2003) elaborates on Cincius (1975/77) and also in- 
cludes data from Kilen and Sibe but still is not exhaustive. Not to be underestimated are 
the data collected in Schmidt (1923a,b; 1928a,b) for Samagir, Samar, Ulcha, Nanai, Oroch, 
Udihe, Negidal, and Evenki. Of these, the first two varieties are almost unknown other- 
wise. Schmidt mentions Samagir ekon ‘what’ and Samar xai ‘what’, which is sufficient 
to classify the two as Ewenic (e.g., Evenki ekun) and Nanaic (e.g., Nanai xar), respec- 
tively (see also Doerfer 1978a). Table 5.114 gives an extended list of cognates for those 
five interrogatives that have the widest distribution among Tungusic languages. For the 
references, see the more detailed descriptions below. The use of Tungusic interrogatives 
or demonstratives as correlatives has recently been investigated in detail by Baek (2016: 
185-226). 

All languages except for some subdialects of Solon and Orogen preserve the interrog- 
ative ‘who’. The form has been reconstructed as “niii (Benzing 1956: 115) or *nui ~ “nei 
(Kazama 2003: 68) for Proto-Tungusic and as "nt for Proto-Ewenic (Janhunen 1991: 70f.). 
Only Kazama’s reconstruction based on Ikegami is erroneous. The original *à regularly 
changed to i in Northern but to u in Southern Tungusic. In some Ewenic languages such 
as Solon or Oroqen as well as Udegheic, the velar nasal changed to an n while it ap- 
parently was lost in all of Jurchenic and Nanaic, except for Uilta and Ulcha. These are 
not regular developments but have certain parallels, e.g. Evenki gina.kin, Solon nini.xin, 
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Language PQ CQ AQ 
Even =Ku# - 2x =Ku# 
Evenki =Ku# - 2x =Ku# 
Evenki (Sakh.) -Kuuf, =too# - 2x =Ku# 
Evenki (Kh.) =Kv# bei# 2x =li# 
Negidal Table 5.111, -Koo£, =i# -, Table 5.111 Table 5.111, 
2x =Kuu# 
Chaoyang Oroqen =YEE#, =oo#, bAA# - 2x =YEE#, 
2x 
jOOmAA# 
(+2x 
=gUU) 
Xunke Orogen =jA#, =o0#, bAA# =jA# (2x =jA +) 
aaki ‘or’, 
2x jooma#, 
2x somal# 
Huihe Solon =gi(i)# (jeeme) 2x =gi(i)# 
Ongkor Solon =uu#, =ii# - 2x =uu# 
Udihe =Ku#, =nA# - 2x =Ku#, 
2x =nA#, 
1-2x 
-(e)s(i)# 
Oroch =nu# - 2x =nu# 
Nanai =nu# - 2x =nu# 
Kilen nə#, =a#, =ma#, =ba# - 2x =no#, 
xoci ‘or’ + 
=a# 
Uilta =(y)i# -KA# =(y)i#, 
yyuu# 
Ulcha =nvv# ?-noof 2x -noof 
Manchu o£, =n(i)#, =nio#, =nA#, =nu# os, =n(i)# Table 5.112 
Aihui Manchu =no# -, ?-jest xeg| or 
Yibuqi Manchu =no#, mA# - xeg| or 
Sanjiazi Manchu =no#, =nu#, ba# =ni#, ?-jA#, -no haishi ‘or’ 
Sibe =na(a)#, ba#, ?-ja# - ?-jos 2x =na(a)# 
Alchuka =(k)o# =n(i)#, ? ? 
Bala =o#, =no# -ott, ? ? 
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Uilta pinda, Nanai enda, Manchu inda.hün, but Udihe in’e.i ‘dog’ (cf. Benzing 1956: 68). 
The short vowel in some northern Tungusic languages must be a secondary innovation 
that is partly shared by the interrogative i(i)-. Kilen ni was borrowed from Udegheic, and 
Kili nii from Ewenic. A form p‘a ‘who’ mentioned by Mu Yejun (1986: 14) for Alchuka is 
most unexpected and cannot be explained with the reconstructed form “niii. Problemati- 
cally, a [p] in Alchuka usually corresponds to an f in Manchu (e.g., Alchuka p'i, Manchu 
fi brush’) and Manchu we clearly corresponds to Nanai ui (e.g., Manchu wesi-hun and 
Nanai uisi ‘up’). It is not very plausible to assume that Nanai ui or Manchu we are not 
related to Uilta nui or Ulcha (ņ)ui. Assuming that the Alchuka form is not a mistake, it 
is most likely related to Manchu we, but details remain obscure for the moment. 


Table 5.114: List of cognates of five Tungusic interrogatives 


Language who what, which which, what how many how 
Proto-Tungusic *püi *ja- *Kai *Kadu *Kooni 
Even i ja-k i-raa-k adi on 
Arman pii jaa-k iää-ra-k aadii oon 
Evenki pi e-kun i-r ady oon 
Negidal nii, mii, jii  ee-xun,ee-kun ii- adii oon(i) 
Solon (Huihe) nii, aawu o-xon ii adi iittü 
Solon (Ongkor) a(y)uu jo-xon i(i) adi 

Orogen (Nanmu) awu i-hun i-r(i) adi ooni 
Orogen (Chaoyang) nii kon i-ri ? oon 

Kha. Evenki B. nii i-kun ii-r adii oon 
Kha. Evenki U. nii ie-kun ii-r adii oon 
Udihe ni(i) jeu ii- adi ono 
Oroch ni jaa-u i- ady oni 

Kili pii ?ii- ~ e- i- adii 

Kilen (ZhY) ni ja - adi əni 
Kilen (ZhP) ni ja - ati əməçi 
Kilen (AJ) ni ja xai xadi, xadu on(n)i 
Kilen (Ling) ui - hai hadu honi-bifi 
Nanai ui - xal xado xooni 
Uilta gui = xai xasu xooni 
Ulcha (y)ui - xai xadu xoon(i) 
Alchuka ?p'o ? kai- kutu katiram 
Bala a(i)- 

Manchu we ya ai udu absi, adarame 
Aihui Manchu va ja D edik avee 
Sanjiazi Manchu wo ja aj udo adolmon 
Yibuqi Manchu va jA ei utu ator(2)mo 
Sibe va ya ai ut afs(e) 


Within the interrogative system of Tungusic “niii has a special position as it is unre- 
lated to the other interrogatives. The same is also true for “ja- ‘what, which’. Benzing's 
(1956) Proto-Tungusic reconstruction *jaa- and Janhunen's (1991: 70f.) Proto-Ewenic re- 
construction *ie- seem to show the wrong vowel quantity and quality, respectively. As 
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for the development of the vowels, note a parallel development in Tungusic “jaa-sa ‘eyes’ 
> Evenki ee.sa, Borzya ii.sa etc. (Benzing 1956: 25; Janhunen 1991: 34). Only Nanaic lan- 
guages have no reflex of *ja-, Kilen being a special case as the interrogative ja has been 
adopted from Udegheic or, less likely, from Jurchenic. Li Linjing (2011: 199) mentions a 
Kilen form ya.o, for which only Udihe j’e.u or perhaps Oroch jaa.u can be the source. 
The extension seen in this form exists only in northern Tungusic. Kili seems to have 
variation between ii- ~ e-, derived from Ewenic. 

The interrogative "Kai (Benzing 1956 reconstructed *xai) is preserved in all branches 
but is absent in some parts of Kilen and exists only in relics in Udegheic. Benzing assumed 
the presence of a suffix attached to a stem “xa-, but no direct evidence for this has been 
found. The initial consonant has been regularly lost in most of northern Tungusic and 
Jurchenic and in most cases changed to a x-like sound in Nanaic. Kazama (2003: 56, 75) 
did not recognize the connection between i(i)- and Nanai xar etc. Admittedly, the stem 
extension (e.g., Evenki i-r ‘which’) can only be found in northern Tungusic. However, 
this must be a secondary innovation of some Ewenic languages that spread from the 
demonstratives (e.g., Evenki e-r ‘this’, ta-r ‘that’). 

There is a certain amount of confusion surrounding the relation of the two stems 
*ja- and *Kai-. For instance, Doerfer (1985: 27) tried to show that they go back to one 
form, but his explanation is extremely speculative and does not appear to be actually 
based on any hard evidence. Nevertheless, the two forms are problematic as they have 
several properties in common, and are partly interchangeable. First, while northern Tun- 
gusic languages have an interrogative verb based on “ja-, the interrogative *Kai- has both 
nominal and verbal properties in Nanaic. This interesting difference can be shown with 
the help of Nanai and Kilen, which has been strongly influenced by Udihe in this regard 
(Table 5.115). 


Table 5.115: Ambiguous interrogative stems in Nanai (Kazama 2007) and Kilen 
(Zhang 2013: 162) 


Nanai Kilen Meaning 
verbal xai-xa-ni ja-xei-ni what-PsT-3sc 
nominal xai-wa ja-wa what-Acc 


Ewenic and Udegheic roughly pattern with Kilen while Jurchenic is close to Nanai but 
apparently is unique in showing an obligatory verbalizer (e.g., Manchu ai-na-, Bala a-na-, 
Alchuka kai-na-). However, Udihe also has an optional derived form j’e-ne-. This split has 
not only been overlooked by Benzing (1956) but also by several other scholars such as 
Tolskaya & Tolskaya (2008: 99). Interestingly, some forms have the same derivation but 
are based on different stems. For example, interrogatives meaning ‘why’ usually have a 
verbal basis and are really converb forms of the interrogative verb, e.g. Even ja-mi, Udihe 
j'e-mi, but Nanai xar-mi and Manchu ai-na-me. Apart from Oroch, Udihe is exceptional as 
it also has the form ii-mi that can be compared with Nanai. Second, the two interrogative 
stems are partly interchangeably in Jurchenic. In Manchu dialects, for example, there is 
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synchronic variation between alternative forms such as ai-erin ~ ya-erin ‘when’ or ai-ba- 
~ ya-ba- ‘where’ without apparent differences in meaning (see below for more examples). 

However, the fact that languages as distantly related and located as Even and Manchu 
have traces of both stems is clear evidence for their existence at a very early stage in the 
development of Tungusic. Furthermore, *Kai is part of a larger group of interrogatives 
that share a resonance in *K~ that most likely is etymologically connected. But even in 
Proto-Tungusic their exact derivation must have already been obscure. For example, Ben- 
zing, based on the assumption of analyzability of *xa-i, reconstructed the interrogative 
meaning ‘how’ as *xaoni, which has to be rejected, as there is no indication of an original 
diphthong. Many modern languages preserve a long vowel, which is why I reconstruct 
the form as *Kooni instead. Janhunen (1991: 70f.) assumed a stem *xoo-, but there is no 
clear evidence that -ni might have been a suffix. This interrogative is preserved every- 
where except for Solon and Jurchenic. In a similar vein, Benzing’s (1956) reconstruction 
*xaduu 'how much' with a long vowel has no real basis as most languages simply have 
a short vowel. Based on the distribution of northern Tungusic i and southern Tungusic 
u, but Nanai o, as well as a comparison with Mongolic (on which see below), the form 
may probably be reconstructed as *Kadu instead. In the latter two interrogatives there 
are some irregular developments such as a progressive vowel assimilation in Udihe ono 
‘how’ (cf. Oroch oni) and a retrogressive assimilation in Jurchenic, e.g. Manchu udu ‘how 
many' (cf. Ulcha xadu). 

Benzing assumed a Proto-Tungusic resonance in *x-. But in my opinion, new evidence 
(e.g., Mu Yejun 1986; Hólzl 2017b) points to a possible reconstruction as plosive (see also 
Rozycki 1993). 'This assumption is based on data from Alchuka that exhibit what could 
be a conservative feature lost in all other Tungusic languages. However, given its un- 
clear phonetic status, for now I use a label K- in the reconstructions instead. The limited 
data from Alchuka contain four interrogatives with an initial unaspirated velar plosive 
k (or perhaps g) that is not present in Manchu (Table 5.116). It has been suggested to 
me by András Róna-Tas (p.c. 2015) that the consonant might be a secondary innovation 
in Alchuka. The initial consonant is attested in about two dozen instances, and it may 
well be a secondary innovation in some of them. However, the fact that it systematically 
appears in many attested interrogatives and has a correspondence in Nanaic x- suggests 
that at least in this position it should be of Proto-Tungusic origin.?! 

From the typological criterion adopted in this study, interrogatives in Alchuka qual- 
ify as K-interrogatives. Regardless of the exact reconstruction that I intend to clarify in 
future studies, Proto-Tungusic clearly has to be classified in the same way. 

Benzing (1956: 114f.) has three more reconstructions (*xalii ‘when’, *xason ‘how much’, 
and *xaba-sikii ‘whither’), all of which exhibit several deficits. Only the last one is at- 
tested in Jurchenic languages. The first two may perhaps be corrected to *Kaali and 
*Kasu(n) (see the description of individual languages below). The last form *xaba-sikii 
poses several problems that cannot be solved easily, but I propose the slightly different 
reconstruction "Ka-bV-si-ki(i) instead (Table 5.117). Apparently we are dealing with a case 
form, more exactly a directive, of an otherwise unknown interrogative starting with K- 


31] am currently preparing a more detailed investigation of the problem. 
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Table 5.116: Selected Alchuka interrogatives (Mu Yejun 1985; 1986; 1987; 1988b,a) 
with Manchu cognates (Norman 2013); inner-Tungusic loanwords are in paren- 


theses 


Alchuka Manchu 
for what reason ( ei) t'uku _ai turgun 
how katiram _adarame 
how many kutu _udu 
to do what kai-na-mei _ai-na-mbi 
what ( ei) _ai 
what has happened gai-na-hanbie _ai-na-habi 
what (is it) kent‘aka _antaka 
when (_ant‘angi)  atanggi 
why (_einu) _ainu 

Table 5.117: Cognates of *Ka-bV-si-kii 

Language Form Meaning 
Even awaskii - awuskii whither 
Negidal awaskii whither 
Evenki awaskii whither 
Oroch (Schmidt) avasee whither 
Kilen (An Jun) xaoci whither 
Ulcha (Schmidt) xavasi whither 
Nanai xaosi whither 
Manchu absi whither, how 
Sibe (Zikmundová) af s(e) whither, how 


that has parallels in the demonstratives, e.g. Even a-wa-ski(i) ‘in this direction, hither’, ta- 
wa-ski(i) ~ ta-wu-ski(i) ‘in that direction, thither’, and a-wa-ski(i) - a-wu-ski(i) ‘in what 
direction, whither’ (Benzing 1955: 77f.; Benzing 1955: 86, 113f.). Manchu preserves the 
forms ebsi ‘hither’ (Alchuka ka’uzi), yabsi ‘how very’, and absi ‘whither’ which has ac- 
quired the meaning ‘how, why’. 

Sibe afs(e) is a regular continuation of Manchu absi. It seems possible that the final 
element was only present in Ewenic but not in the other Tungusic languages. Note that 
there are several case forms that may either stand alone or may be combined with a 
comparable suffix, e.g. Solon dative -dU, ablative -dU-xi. Apart from the case suffix, there 
is another element *-bV, possibly of nominal origin, that might also be present in Proto- 
Tungusic *Ka-bV-gu(u) ‘which’, an interrogative that had not been reconstructed by Ben- 
zing (1956) (Table 5.118). Even aw-gic ‘whither’ could go back to the same source *Ka-bV-. 
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Table 5.118: Cognates of *Ka-bV-gu(u) ‘which one > who’ (cf. Hélzl 2014b); 
Schmidt = Schmidt (1923b), Grube = Grube (1900), Castrén = Castrén (1856) 


Language Form Meaning 
Uilta xaawu which (one) 
Ulcha (Schmidt) xavu which (one) 
?Nanai (Grube) xawui which (one) 
Evenki (Castrén) awguu ~ abguu which (one) 
Evenki avgu which (one) 
Evenki (Khamnigan) abguu which (one) 
Even awug - awag which (one) 
Negidal L avvu, avgu, au, U avgavu which (one) 
Solon (Hailar) ayuu who 

Solon (Huihe) awu who 
Oroqen (Nanmu) awu who 


In Uilta, the initial consonant has been preserved as x-, but intervocalic *VbV and *V gV 
have both been regularly lost (Benzing 1956: 30, 34). The final *uu must have changed to 
wu following the newly formed long vowel aa. Uilta in addition has a special accusative 
form xaakkoo (Tsumagari 2009b: 4, 7f.) which might indicate the presence of an earlier 
consonant other than w since only stems ending in -CV show this type of assimilation of 
the accusative marker -BA and the geminate kk indicates a plosive. This consonant may 
have been a relic of the original *g. My reconstruction is almost identical to Kazama's 
(2003: 68) *xabagu. But the vowel in the second syllable is not entirely certain as it has 
been lost in several languages and shows variation between a ~ u in Even. The inter- 
vocalic "VbV changed to w in northern Tungusic languages. The Even variant awu.n 
indicates that the final *-gu(u) is a suffix that replaces the unstable nasal. In Solon, the *b 
> w was lost and the *g changed to y. After the y had been lost in some Solon dialects, the 
final long vowel must have changed to wu as in Uilta. The second possibility that Solon 
ayuu ~ awu goes back to a form without the suffix *-gu(u) in which the *b changed to 
y is less likely due to the presence of a long vowel that can only be traced back to the 
suffix. Oroqen awu is a Solon loanword. Some points remain unclear, however. For ex- 
ample, does Khamnigan have a b instead of the expected w, because of the following g 
and how does the Upper Amgun Negidal form avgavu fit into the picture? Possibly there 
was a variance between different suffixes, such as in Evenki idy-vu, idy-gu ‘which one’. 
Cincius (1975/77: 4f.) includes Manchu absi ‘how’ in the list of cognates, which is clearly 
a mistake. 

There is one rather problematic interrogative that has several functions and can have 
both verbal and nominal properties. In interrogative sentences the meaning is extremely 
broad as it may be translated as ‘who’, ‘what’, ‘which’, ‘where’ or ‘how many’ (Bulatova & 
Grenoble 1999: 24). Given its unclear semantics it has been glossed as INT (interrogative). 
Consider the following example from Evenki. 
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(375) Evenki 
agii anii-Ba arii-Za-ra-n? 
INT INT-ACC INT-IPFV-PRS-3SG 
"Who is doing what?' (Bulatova & Grenoble 1999: 26) 


Problematically, the word may also be used in declarative sentences where it may “re- 
place nearly any verb" (Bulatova & Grenoble 1999: 26) or may also function as a demon- 
strative. Given that cognates from the Nanaic branch do not show an initial consonant, 
this word is clearly of a different origin than the other interrogatives. The best treatment 
of this unusual word has been given by Idiatov (2007: 301ff.). Elaborating on Cincius 
(1975/77), he gives the following account. The word started out as a noun meaning some- 
thing like ‘thing’, which in Evenki may have been combined with the genitive or the 
alienable possession marker. The second step was the development of a “placeholder or 
filler", such as English whatchamacallit (Idiatov 2007: 302). This function is attested in 
several other Tungusic languages. The last step was from a placeholder to an interroga- 
tive. Since the last function is restricted to Evenki, the forms from other languages will 
not be treated here any further. 

Tungusic interrogatives exhibit several striking similarities to Mongolic that cannot 
be explained by chance (Table 5.119). These comparisons do not stand on their own but 
join well-known similarities in the personal pronouns and demonstratives. 


Table 5.119: Similar interrogatives in Mongolic and Tungusic 


Mongolic Tungusic 
what *ya-xu/n *ja- (northern + *-ku/n) 
to do what *ya-xa- *ja- (only northern and Kilen) 
how many/much *kedii- *Kadu 
when *keli *Kaali 


This is not the place to present a discussion of a possible genetic connection between 
Mongolic and Tungusic, but it should be pointed out that language contact could also 
account for these similarities (see §5.8.3). Most likely, the forms have been borrowed 
by Tungusic because the morphology involved is also known from other elements in 
Mongolic, such as demonstratives, e.g. “e.li, *te.li or *e.dü-, *te.dü- (Janhunen 2003d: 20). 
The first two from the list also appear to have been borrowed by Nivkh (§5.2.3) 

The following will address interrogatives in the individual branches of Tungusic in 
turn. Table 5.120 gives some interrogatives from Arman and Even. Even has some unique 
developments in interrogative paradigms (Table 5.121). While the stem extension of the 
interrogative i-, an extension from the demonstratives, is shared by most Ewenic lan- 
guages, Even i-raa-k exhibits a further innovation. The final -k stems from the interroga- 
tive ja-k and even found its way into the demonstratives and may tentatively be analyzed 
as a newly formed nominative marker that is restricted to these four stems. 
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Table 5.120: Interrogatives in different dialects of Arman and Even (Doerfer 
& Knüppel 2013, modified; Benzing 1955; Sotavalta 1978: 12, passim, modified; 
Schiefner 1874); DK = Doerfer & Knüppel, B = Benzing, S = Sotavalta, Sch = 
Schiefner; case forms and several alternatives are not shown 


Arman (DK) Even (B) Western Even (S) Anadyr (Sch) 


who nii, pii ri ii ri 
what jaa-k ja-k je-k ja-k 
which iaa-ra-k i-raa-k i-ra-k i- 

to do what jaa- ja- ie- (i)ja- 
why (cvs) jaa-mi, jaa-ngai ja-mi ie-gaji (i)ja-m(i) 
which one awug 

where i-lee, iàá-laa i-loo i-là, i-la, i-DDw i-la 
how oon, uun on letat, ‘ekat on 
when ook ok ok 
how much/many  aadii, adal adi adi 

how much/big asun asun ahun ~ ahumn 


Table 5.121: Nominative and accusative case forms of interrogatives and demon- 
stratives in Even (Benzing 1955: 77, 79) 


who what which this that 
NOM pi ja-k i-rəə-k ə-rəə-k ta-ra-k 
ACC Di-w jaw i-ro-w 29-ro-w ta-ra-w 


The Evenki and Negidal interrogative systems are extremely similar to one another. 
Generally, the forms tend to be a bit longer than those in Even. In Khamnigan Evenki, 
while the Urulyungui dialect preserves a small difference between ie- and ii-, the two 
interrogatives "ja- and "Kai- completely coalesced into i(i)- in the Borzya dialect. In gen- 
eral, Khamnigan Evenki interrogatives appear to be more closely related to Oroqen than 
to Evenki. Apart from this partly shared sound change, both groups have also changed 
the initial velar nasal to an alveolar nasal in nii ‘who’ and have a form aali ‘when’ instead 
of ookin in Evenki. But apart from awu ‘who’, which is borrowed from Solon, Oroqen 
does not have a cogante of abguu ^which one'. The Khamnigan form iir-giiji ^ iir-giid 
has a cognate in Orogen iri-gida and Evenki ir-git. Evenki dialects, such as the one from 
Sakhalin, exhibit a very similar interrogative system but shows some regular phonolog- 
ical differences (e.g., axun ‘how many’, Bulatova & Cotrozzi 2004; Atknine 1997). 

There are descriptions for several Oroqen dialects, the interrogatives of which are 
given in Table 5.123. Only a selection of case forms is included. The complex forms ixun- 
tfaalin or tkon d3aalm ‘why’ and adi erin-du ‘when’ contain the Manchu loanwords 
jalin ‘reason’ and erin ‘time’. The second part in iri-gotfin ‘what kind of’ is probably not 
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Table 5.122: Evenki (Nedjalkov 1997: 3-18, 135-136, 214-216, 318ff.), Negidal (Cin- 
cius 1982: 34, passim), Khamnigan Evenki (Janhunen 1991: 70f.), and Aoluguya 
Evenki interrogatives (Hasibate'er 2016: 171, 238); U - Upper Amgun, L - Lower 
Amgun dialect of Negidal. B - Borzya, U - Urulyungui dialect of Khamnigan 


Evenki 


Meaning 


who 
what 


what kind of 
(-dy 'ApJ) 
what kind of 
(PL) 

(-ma ap) 
why 

(-da ‘-cvB’) 
how much 
how many 
times 

by what 
number 
(-tAl ‘QUANT’) 
which 

what, which 


how many 
which, what 
kind 
whither 
what, which 


which 


where 


which one 
how much/ 
many 
when 


how 


Evenki 
ni 
e-kun 


e(e)-ku-dy 


e(e)-ku-ma 


ee-da 


ady 
adyra 


ady-tal 


anty 
ani 


asun 
avady 
avgu 


i-r 
i-du 


idyvu, idygu 
oki 


ookin 


oon 


Negidal 
nii, nii, pii 
L ee-xun, 
U ee-kun 


ee-daa(j) 


adii 
adiijaa 


adiital 


an, ani, ane 
‘yes, alright, 
okay’ 
asun 


awaskii 
Lavvu, avgu, 
au, U avgavu 
ii 

ii-duu, ii-laa 


L ooxin, 
U ookin 
oon(i) 


Khamnigan 
nii 

B i-kun, 

U ie-kun 


adii 


abguu 

ii-dvv, B i-lee, 
U ii-lee 

ooki 

aali 


oon 


Aoluguya 


ni, nr 
&:-qon, ae:-koon 


adi, addi 


irogecin, irga:tfin 


r-ra, ig 

i:-du, r-la, r-ra 
irba, rca: 
aqin, bein 


æ:xa 
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Manchu hacin ‘kind, sort, class, item’ (from Korean) because there is a similar suffix in 
other Ewenic languages (Benzing 1956: 100), e.g. Evenki -gAchin ‘similar to, just as, like’ 
(Nedjalkov 1997: 56), e.g. Aoluguya Evenki iragecin ~ irga:tfin. The suffix -du is a locative 
and dative case marker that can also be found in oki-du ‘when’ (based on oki ‘how much’, 
influenced by Solon), ixu-tu ‘when’ (based on i-xun ‘what’ with stem extension) and (i)i- 
tu ‘where’ (based on i-(xun) ‘what’ without stem extension). The etymology of idzirgee 
‘which one’ remains unclear for now. The Nanmu forms awu ‘who’, oonde ‘what kind of’, 
and joonde ‘why’ have been adopted from Solon. The same is probably true for iktu ‘how’ 
as mentioned by Chaoke. The origin of jeE-ma ‘which one’ remains unclear, but a con- 
nection to Mongolic seems plausible. One can observe a slow phonological convergence 
of the two different stems i-hun ‘what’ and i-r(i) ‘which’. 


Table 5.123: Interrogatives in different dialects of Oroqen (Hu Zengyi 2001: 101, 
148, 261; 1986: 94; Chaoke D. O. 2007: 47, 257f.; Zhang Yanchang, Li Bing, et al. 
1989: 56, 141, 184; Han Youfeng & Meng Shuxian 1993: 43, 264-265; CK = Chaoke 
D. O. 2014a: 164-165, passim) 


Chaoyang Wulubutie Nanmu Xunke Shengli CK 


who nii nii awu nii nii ni 
what Ikon Ikon ihun ikon ixun ikun 
when aali(1) aal aala aali aali, oki-du, 
ixu-tu adi 
erin-du 
how many adi ati adi 
what kind of iri-getfin, iri-gatfin, oondi iri-gotein iri- 
iggetfin, ipetfin, kotf'in, 
in- in- igNetf'in 
nuwoon nuwoon 
which i-ri i-ri i-r(i) (i)i-ri i-ri 
which one jEEma jeman 
which one (PL?)  idzirgee idzirgee idzirga itfarkee 
where i-loo, i-du i-lə i-rə i-ləə, iitu i-le(-ni/ 
ni) 
whither iri-gida iri-gida iri-gida iri-gida iri-kita, 
lir-to 
why 1da, (kan iida, iima, ixun- 
dzaalin joonde tfaalin 
how gon oon ooni oon (w)oon oon, iktu 
how much ooki oohi əəki ER (o)oki 


Solon interrogatives are probably the most aberrant among Ewenic languages. The 
interrogative ni(i) ‘who’ has been almost completely replaced (see Table 5.124). The un- 
expected vowel quality in (j)o-xon ‘what’ can possibly be attributed to influence from 
Dagur (yoo(n) ‘what’). ohi-du ‘when’ similar to Evenki is based on ohi ‘how much’, but 
contains an additional locative marker. This form has been adopted by one Orogen di- 
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Table 5.124: Interrogatives in different dialects of Solon (Chaoke D. O. 2009: 
35f., 250ff., 351£., 355; Tsumagari 2009a; Poppe 1931: 110); CK = Chaoke, T = 
Tsumagari, P = Poppe, R = Ramstedt (Aalto 1976; 1977, modified), K = Kamimaki 
(Lie Lie 1978: 175, 177, modified); case forms are not listed 


Huihe Huihe(T)  Hailar (P) ` Ongkor Ongkor 
(Ck) (R) (K) 
who awu, ni aawu, nii ayuu auu-ayuu  ahu 
what o-hog ii, o-xon ii i(i), o-hon, 
jo-xon u-hun 
when ohi-du ooxi-du ooxii-du aali oke-du 
what kind of ondi oondii oondii oondin onde 
which iggu ii-r, iggü iixuru 
where i-o ii-lee ii-lao ii-lo i-e, i-lo 
how ittu iittü iitturu 
why ida, joodog yoodon iidaa, ida, yoda 
joodaa joxon-du 
how many adi adi adii adi aade 
how much ohi ooxi OOxii, ooki ooke, 
oondii one 


alect while Ongkor Solon aali ‘when’ in turn can perhaps be traced back to influence 
from Oroqen. Dagur yoondaa ‘how, why’ is perhaps the source of joodaa ‘why’. 

The origin of iggu ‘which’ and ittu ‘how’ is unclear, but in Solon a geminate suggests 
the earlier presence of a consonant cluster as can be seen in many examples, e.g. Evenki 
irgi, Solon iggi ‘tail’. Possibly, the forms are based on the stem i(i)-r(i), followed by a 
case ending. At least synchronically the form with the suffix -r(i) has no wide distribu- 
tion among Solon dialects, which usually employ the bare stem i(i). But further evidence 
for this view can be gleaned from the demonstratives e-ri ‘this’ and ta-ri ‘that’ that still 
have the extensions, and the derived forms ettü ‘in this way’ and tattü ‘in that way’ 
(Tsumagari 2009a: 3, 6). Problematically, from a synchronic perspective no case marker 
has the expected form *-gii or *-tü. At least the latter may have a connection with the 
dative -du ~ -dü that in Evenki also has a variant -tu with an unvoiced consonant. Ram- 
stedt’s Ongkor Solon materials have been recorded in Tacheng (Lie 1978: 128). From the 
city Alimtu, Muromskij collected several unproblematic forms including au ‘who’, ad’ ~ 
adi ‘how many’, ile ‘whither’, ida ‘why’, on(i) ‘how’ (Lie 1978: passim, Katuzynski 1971: 
passim). Chaoke D. O. et al. (2014: 63) mention a form antie ‘how’ that seems to corre- 
spond to Evenki anty ‘which’. 

Kili is a mixture of Nanaic with Ewenic elements (Doerfer 1978a), but judging from 
the interrogatives alone, Kili appears to have stronger affinities to Ewenic than to Nanaic 
(e.g., pii ‘who’, e-ma ‘what’, adii ‘how many’, ali ‘when’, i-du ‘where’, ii-daj ‘why’, osi 
‘which one’, Kazama 2003: passim; see Sunik 1958 for details). Note the characteristic 
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form nii as well as the absence of the initial consonant x-. Kili osi remains obscure but 
has a cognate in Kilen oci. The two interrogatives iidaj and ema are also characteristic of 
Ewenic, but might stem from two different sources as indicated by the different length 
and quality of the initial vowel. 

As expected, interrogatives in Udegheic languages show affinities with Ewenic. Ta- 
ble 5.125 gives an overview of some forms attested in Udihe and Oroch. 


Table 5.125: Udihe (Nikolaeva & Tolskaya 2001: 348ff.; Tolskaya & Tolskaya 
2008: 100) and Oroch interrogatives (Avrorin & Boldyrev 1978; Lopatin 1957, 
collected in 1924, modified); not all variants are listed 


Udihe Oroch 

NT AB Lopatin 
who ni(i) ni nvi 
what je-u jaCo) ya(-u) 
what kind of je-me 
why je(-ne)-mi, ii-mi jee-mi, jd-mi, jà-na-mi, ji-mi 
what, where, why jefe jàvo yava 
when ali ali ale 
how many adi adi ady 
how ono oni oni 
which onobui 6n’i bi 


Nikolaeva & Tolskaya (2001: 348) claim that j’e-fe ‘what, where, on which place’ is 
an accusative form. It seems, however, that this form rather corresponds to Manchu ya- 
ba and Sibe ya-va ‘where, which place’. The strange looking form onobui is probably a 
contraction of ono ‘how’ with an inflected form of the copula bi-, which has a parallel in 
Kilen. Apart from this, several more Udihe interrogatives have been adopted by Kilen as 
well (see below). 

The interrogative ni(i) ‘who’ is declined as the word nii (or ninta) ‘man’ but does not 
have an etymological connection to it, as claimed by Schulze (2007). Instead, the forms 
correspond to Nanai ui and nai, respectively, and are similar only by chance. But one 
cannot exclude the possibility of a folk etymological connection. The interrogative j'eu 
exhibits some irregularities. Apart from the nominative forms, the paradigms are parallel 
to the demonstratives (Table 5.126). The ending -u in j'e-u or jaa-u ‘what’ is identical in 
origin with Evenki -kun in e-kun. In Oroch, but not in Udihe, this extension is also found 
in most case forms. This is a secondary leveling that has a parallel in Evenki and Oroqen. 
Udihe j’euxi ‘whither’ probably corresponds to Nanai xaosi but is based on a different 
stem. As in Nanai the same suffix -uxi can otherwise only be found in the demonstratives. 

In Udihe, forms such as the dative j’e-du or the locative j’e-le have the variants ii-du and 
ii-le. According to Nikolaeva & Tolskaya (2001: 349), this only represents a difference in 
pronunciation. However, ii- really is the relic of a different stem of which no nominative 
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Table 5.126: Interrogative and demonstrative paradigms in Udihe (Nikolaeva & 
Tolskaya 2001: 100, 343f., 348) and Oroch (Avrorin & Boldyrev 2001: 193, 197) 


Udihe Oroch 
who what this that who what this that 
NOM ni jeu ee, ei ute, uti, ` Du jaa-u ei tei, ti 
tee, tii 
ACC  ni-we (ewe  a-wa (u)ta-wa | n'iive ` jaa-va ee-ve X taa-va 
DAT  nidu  je-du  o-du (u)ta-du — n'iidu  jaa-u-du  ee-du  taa-du 
LAT ni-tigi —j'e-uxi a-uxi,i-tigi  (u)ta-uxi, n'ii-jijü jaa-u-jiji eet ^ taa-jiji 
(u)ta-tigi 
LOC ni-le je-le o-lo (u)ta-la n'ii-le jaa-u-la ee-le taa-la 
PROL nii je-li o-li (u)ta-li n'ii-li jaa-u-li ee-li taa-li 
ABL ni-digi je-digi o-digi (u)ta-zi n'iidui  jaa-u-dui ee-dui  taa-dui 
INST — nizi je-zi o-zi (u)ta-digi oun jaa-u-ji ee-ji taa-ji 


Table 5.127: Selected case forms of two different interrogatives in Udihe, Even, 


and Manchu 
Udihe Even Manchu 
NOM jeu : ja-k i-raa-k ya ai 
DAT je-du ii-du ja-du i-du ya-de ai-de 
LOC je-le ii-le ja-la i-laa - - 
CVB jeCne)mi  ii-mi ja-mi - - ai-na-me 


or citation form is left in Udihe (Table 5.127). Oroch likewise has these alternative forms, 
e.g. i-du (Schmidt 19282). 

In Manchu the locative *-/A is only preserved in relics (e.g. ama-la ‘behind’) and the 
stem extension is restricted to the demonstratives e-re 'this' and te-re 'that'. Strangely, 
Udihe also shows this variation between two stems in the interrogative ii-mi ~ j'e-mi 
‘why’. Given that these are converb forms, Udegheic is the only branch in which both 
stems can function as interrogative verbs. Udihe ii-mi directly compares with Nanai xar- 
mi and j'e-mi with Even ja-mi. Udihe furthermore has the variant j'e-ne-mi, which is 
similar to Manchu ai-na-me, but is based on the other stem. 

Unlike all northern Tungusic languages and most of Jurchenic, Nanaic interrogatives 
form a coherent system in which all forms share the resonance x-. The only exceptions 
is the interrogative meaning ‘who’ that was already different in Proto-Tungusic, as well 
as some isolated Uilta forms that have perhaps been borrowed from Nivkh (sado, saa, 
nuulu) (see 85.2.3). Patryk Czerwinski (p.c. 2018) elicited the forms sadu and emphatic 
sadoo from a northern Uilta speaker. 
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Table 5.128: Interrogatives in Najkhin Nanai (Kazama 2007), Ussuri Nanai (Sem 
1976), Ulcha (Majewicz 2011), and Uilta (Ikegami (1997); Tsumagari (2009b); Ma- 
jewicz (2011)); accents removed 


N. Nanai U. Nanai Ulcha Uilta (M) Uilta (TI) 

who ui ui nui puj gui 
what xai yay xaj xaj xai 
todo what  xai- xay Cra)- xaj- xai- 
why xai-mi xajo xai-mi 
when xaali xal’y xali xali xaali 
which one xamacaa yamatsa xamata xaawu 
what for xaigoji xaonk’y, xajd’u xajbu 

xaonk’y, 

xaonk’yd’y 
where xai-do xay-do xaj-du, sado ` xai-du 
whither xaosi xaos (y), xajban xavasaj, xoty,  xawasai 

Xaos y, nuulu 

yay-xaydu 
whence xajaji xajdani xavedu xamaččuu 
how xooni xon’i xon(i) xooni xooni 
how many xado xado, xado xasu, xadum  xasu xasu 


With respect to the other Nanaic languages, Kilen exhibits a very different set of in- 
terrogatives (Table 5.129). Not only is there a rather confusing variation in the origin of 
the individual forms, but different accounts show striking differences as well. The forms 
that most closely resemble Nanai have been collected by Ling Chunsheng (1934) and they 
might actually represent Hezhen instead of Kilen. All other descriptions show variations 
between some forms of Nanaic and some of Udegheic or Jurchenic origin that have been 
borrowed. yadi, if this is not a typo, is an especially interesting form as it combines fea- 
tures typical of southern and northern Tungusic. It shares the initial consonant typical 
for Nanaic, but has a final -i that can only be of northern Tungusic origin. Udihe loans 
include ni, adi, oni, and maybe several more such as ja, uki, and onibici, although the lat- 
ter has also been recorded with an initial consonant atypical of Udihe. Manchu elements 
include the nouns jaka ‘thing’ (in ia-ma-dzaka) and perhaps erin ‘time’ (in ia-ma-arin, 
adi/ ya erin-du, and iarin). The interrogative ja is certainly of Udihe origin, because Li 
Linjing (2011: 199) mentions a form ya-o ‘what’ that can only stem from Udihe j’e-u but 
not Manchu ya. The interrogative iatin ‘when’ appears to be a combination of ja and 
perhaps an otherwise unknown noun meaning ‘time’ or suffix that can also be found 
in Manchu atanggi ‘when’, see below. The forms onnomi and onaqami are obscure but 
may contain the converb marker -mi. As seen before, NDSSLD (1958: 82) mentions the 
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Table 5.129: Interrogatives in different descriptions of Kilen (Ling Chunsheng 
1934: 243, 245; An Jun 1986: 38, 63; Zhang Yanchang, Li Bing, et al. 1989: 40, 
44f., 70, 74f., 88, 144; Zhang 2013: 95, 162f.; Chaoke D. O. 2014b: 164 et passim); 
the table also contains all available case forms. L - Ling Chunsheng, AJ - An 
Jun, ZZD = Zhang et al., Z = Zhang, CK = Chaoke; forms from An Jun (1984) 
in square brackets 


CK 


ZZD 


who 
what, which 
what 
what (thing) 


which 

where 

where from 
where to 

when, what time 


how 


how many 


how much 


why 


hai 


honibifi 


hadu 


AJ 

ni ni 

ja ia 

xai, [xajo] 
ia-mo- 
dzaka 

EISEN dcimkan 
ja/ia-du 

xaoci ia-lo 
ja-ma-orin 

ombico, oni, onibici 

on(n)i, ‘what’ 

onnomi, 

[onbi¢a] 

xadi ~ adi 

xadu, [adi] 

onaqami unakomi 


dei 

ja-tu 
ja.tu-tiki 
ia-tu-la 
jarin 


omaci 


ati 


uki 
oynnomi, 


ni 
ya 
hay 


ya-ma 


iri 
i-lo, i-du 


jotin, ya 
erin-du, 
adi 
erin-(du/ 
ni) 

oni, ikti 


adi 


uki 


omomi 


two Kilen forms ya-le DRÉI and ali [iy B 


E32 Kilen ocimkan is a contraction of oci and 


the numeral amkan ‘one’, which is most likely of Jurchenic origin (Manchu emken). The 
interrogative oci has a cognate in Kili osi and ikti ‘how’ perhaps in Oroqen iktu ‘how’. 
Both remain unclear etymologically. 

Interrogatives in Jurchenic show marked differences from the other Tungusic lan- 
guages. Almost no information is available for Bala and the few interrogatives available 
for Alchuka have already been given in Table 5.116. Sibe and Written Manchu, on the 
other hand, are exceptionally well described. Table 5.130 gives an overview of interroga- 
tives in Manchuic languages. Aihui Manchu edzaxa ~ edzay, edik, according to Enhebatu 


32The character Œ was incorrectly written as ¥ 


Kusi 
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(1995: 149), has a cognate in Sanjiazi Manchu aizurg, aizig, aizyi ‘how much, many’, but 
remains obscure. Sibe yask(a) might be comparable as well, but seems to be based on ya 
instead of ai. The Manchu form adarame (Alchuka katiram) has never been analyzed in a 
clear manner. There are several possibilities, but the most likely scenario is a derivation 
from the interrogative ai that subsequently lost the i as in other forms. If the final -me is 
the imperfective converb form as a comparison with Nanai xar-mi ‘why’ might suggest, 
then at least one of the other elements present might have been a verbalization. Both -dA 
and -rA are attested in this usage, but their combination would be most unusual. Prob- 
lematically, the verbal interrogative in Manchu has the regular form ai-na- (Bala a-na-, 
Alchuka kai-na-). In fact, Manchu also has the expected form ai-na-me ‘how’. Perhaps 
the form has to be analyzed as *a(i)-da-ra-me with an unclear derivation of the interrog- 
ative stem. The forms ainu and antaka (Alchuka kent'aka) are even more obscure but 
probably derive from *Kai-, too. 

Unfortunately, there is almost no record of Jurchen A interrogatives. However, there 
is a form that has been reconstructed as *wanon fii} ‘how’. Kiyose (1977: 137) hypothe- 
sized that it might be connected to Manchu antaka, but this is not very convincing. At 
first glance, no similar looking form is attested in any other Jurchenic language. The only 
tentative solution that I can think of, apart from an altogether unknown interrogative or 
a mistake, is to compare "wanon with Manchu ai-na-me, which has the same meaning 
and shows some remote formal resemblance. The initial *w- is extremely problematic but 
could perhaps be a reflex of an initial consonant, cf. Alchuka kai-na-. The interrogative 
ai lost the i in some other instances as well, cf. Bala a-na-. What has been reconstructed 
as *-n might thus be a converb form that does not, however, match Manchu -me or Bala 
-mi (Mu Yejun 1987: 30). The converb has been recorded in the form -n in the modern Ai- 
hui dialect, but such a comparison would be anachronistic. Nevertheless, a converb form 
would be expected because the following word was a verb. Kiyose (1977: 140) assumes 
that *ain ZS DI may be the same interrogative as Manchu ainu, which seems accurate. 
Kiyose (1977: 144) also mentions a form adi 9 DU that has been translated as ‘etc. This 
might correspond to Manchu udu ‘how many/much’, but, if true, is closer to northern 
Tungusic. Notice that Manchu udu may also mean ‘several’, which is a bit closer seman- 
tically. 

It is well-known that Manchu has a very special and aberrant position among Tungu- 
sic languages. In fact, the differences are so strong that I have previously put forward 
the possibility that, in Operstein's (2015) terminology, Manchu is a contact variety that 
shows a certain amount of simplification (Hólzl 2012). An additional argument in favor 
of this hypothesis is the existence of many analyzable interrogatives that consist of ei- 
ther ai- or ya- together with a noun or, in some cases, another element (Table 5.131). 
Most forms are not normally written in one word in Manchu, but may be analyzed 
as compounds. Most dialectal forms that go back to these compounds have not been 
listed above. Most of these formations are very transparent. In some cases there is an 
unexpected semantics, such as in ai-se-me ‘why’. Hauer (2007) additionally mentions 
the form ainam.baha- ‘to get how’, which is a combination ainame ‘how’ and baha- ‘to 
get’. It is highly doubtful that bi- in ai-bi- is the copula bi as claimed by Gorelova (2002: 
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Table 5.130: Interrogatives in Manchuic (Norman 2013; Zikmundova 2013; 
Wang Qingfeng 2005; Kim et al. 2008; Zhao Jie 1989); most case forms and 
some variants are not listed 


Written Sibe Aihui Sanjiazi Yibuqi 
who we va va wa va 
what ai ai € aj ei 
to do what aina- aino- 
which ya ya ja jA 
when atanggi, aitin, ya etin(ye), ajtin, aj ei inka 
ai erinde erin-t e(j)irin alin 
what kind of thing  ai-jaka, ya ai jaq(ə) atsaga, aj jaka ei tgAxya 
jaka edere 

what (is it) antaka antq 

which one yaemken, yam, ja(m)kon, j^ omk'o 
yaka yamko? jama 

where aiba-, £.Vi-, £ba-, jawu- ~ jApA- 
aibi-, yava-, java jawa(-) 

yaba-, ye-, ai- 

ya-, ai- 
how absi, af$(e) avee ~ adalman atar(a)ma 

adarame, avce 

ainame 
why ainu, ai a? an - ar, ai tulxun anA 

turgun ana ~ ano 
how many udu ut, udele, ude, udali utu 

yask(ə) £dzoxo ~ 
edzəy, 
edik 


219) because it would be impossible to attach a case marker to it, e.g. ai-bi-de ‘where. 
Most likely it really is a variant of ba ‘place’, e.g. ai-ba-de. Manchu ai-ba- has interest- 
ing correspondences in the demonstratives. While the demonstratives have the usual 
form e-(re) ‘this’ and te-(re) ‘that’, the correspondence of ai-ba- is u-ba- ‘here’ and tu-ba- 
‘there’ A special case is represented by atanggi, which seems to be based on ai ‘what’. 
The form may be amalgamated, but, based on the meaning ‘when’, one may suspect a 
noun meaning ‘time’ to underlie the second part. In fact, ai-erin- is such a compound. 
But there is no word meaning ‘time’ with an adequate form in Manchu and atanggi is 
not synchronically analyzable. Table 5.132 shows all dialectal cognates available. 

The Alchuka and Bala forms might indicate a connection with Manchu antaka. How- 
ever, according to Mu Yejun (1988b,a), the -n- in one form of Bala and in Alchuka is an 
innovation. The Kilen form iatin has most likely been borrowed but is probably built 
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Table 5.131: Analytical interrogatives in Manchu (Hauer 2007; Norman 2013; 


Holzl 2015c) 


Form Meaning Basis Meaning 
ai-ba-, ya-ba- where ba place 

ai-erin- when erin time 

ya-emu, ya-emken which one emu~emken one 

ai-hacin, ya-hacin what kind of hacin kind, sort, class 
ai-haran why haran reason 

ya-ici whither ici direction 
ai-jaka, ya-jaka what sort of jaka thing 
ai-niyalma, ya-niyalma what person, who  niyalma person 

ai-se- to say what se- say- 

ai-se-me why se-me Say-CVB.IPFV > QUOT 
ai-turgun why turgun reason 

atanggi when ? ?time 


Table 5.132: Cognates of the Jurchenic temporal interrogative; not all variants 


are shown 


Language Form 

Manchu atanggi 

Aihui Manchu etin(ne), etinni 
Sanjiazi Manchu aitin, aitinga, aitinne 
Sibe aitin 

Yibuqi Manchu ei t'inko 

Bala a(n)t’annia, at'ani 
Alchuka ant'aggi 

Kilen ?iatin 


on Manchu ya instead of ai. Given the absence of a fitting word meaning ‘time’ within 


Jurchenic, it may have been borrowed from another language, not unlike Manchu hacin 
‘sort’, which according to Benzing (1956: 100), stems from Korean. Middle Korean enu-cjej 
‘when’ matches the Jurchenic form typologically (‘what-time’), but not formally (§5.7.3). 
A connection to (Eastern) Old Japanese tókyi 55 [x ‘time’ (Kupchik 2011: 60, 106) seems 
extremely implausible, but this is the only form I was able to find in surrounding lan- 


guages that at least looks remotely similar. The interesting proposal by Alonso de la 


Fuente (2017) that atanggi might be related to other words ending in +nggi such as senggi 
‘blood’ is not plausible on semantic grounds and would leave the first part ata- unana- 


lyzed. For now, the origin of the interrogative remains obscure. 
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5.11.1 Classification of Turkic 


The internal diversity of Turkic is much more elaborate than that of, say, Mongolic, 
but less so than Uralic. Based on Johanson (1998: 81f.; 2006a: 161f.) the languages may 
roughly be classified as in Figure 5.7. An asterisk indicates that a given language is at 
least partly spoken in NEA today. Of course, Turkic languages altogether derive from 
southern Siberia and northern Mongolia (e.g., Yunusbayev et al. 2015), which is why 
languages such as Turkish and Chuvash will also briefly be addressed. Most languages 
included here are from the Northeastern or Siberian branch. The Turkic language Yellow 
Uyghur that is also called Western Yughur (xībù yùgù yù Fou #4 [li in Chinese) or 
Sarig Yughur, has to be distinguished from the Mongolic language Shira Yughur or East- 
ern Yughur (dongbà yügà vg RASA AIF in Chinese, §5.8). Yellow Uyghur has no close 
relation to Uyghur, which belongs to an altogether different branch of Turkic. Despite 
its name, Fuyu Kyrgyz, spoken in the Heilongjiang province of northeastern China, is 
more closely related to Yellow Uyghur and the other Abakan Turkic languages than to 
Kyrgyz as such, which belongs to the Kipchak branch. Similar to the Tungusic language 
Sibe that was partly relocated to Xinjiang in 1764, Fuyu was brought to Manchuria from 
the Altai region in the 1750s under emperor Qianlong (see Hu Zhenhua 1986; 1996; Hu 
Zhenhua & Imart 1987; Janhunen 1996). 

Eynu (dini X 5f in Chinese) can be considered a truly mixed language. It is “a lan- 
guage that is structurally and grammatically Uyghur, but whose vocabulary is predom- 
inantly Persian or Persian-derived.” (Lee-Smith 19962: 861). Possibly, the origin of Eynu 
lies in its use as a secret language. Because this study is mostly concerned with the gram- 
mar of questions, it has been classified as basically Turkic here (see also Tooru et al. 1999 
for a discussion). The alternative name Abdal has a derogatory meaning and will not be 
used (see also Ladstátter & Tietze 1994, Wurm 1997). Another special case is Salar, which 
is the result of language contact between Turkic languages from different branches. 


Salar originated as an Oghuz language and during the course of its speakers' grad- 
ual eastward migration acquired various influences from southeastern- and north- 
western-type Turkic languages as well as from non-Turkic ones. It had been as- 
sumed earlier that Salar is an isolated dialect of Modern Uyghur, mostly on the 
basis of phonological features such as liquid assimilation and vowel raising. (Hahn 
1998: 400) 


Ili Turki (tá érké T.S. in Chinese) is a language that is not very well-known and for 
which only a few descriptions are available. According to Hahn's (1991: 31) classification 
it shares properties with both the Kipchak and the Uygur-Karluk branches of Turkic, but 
it has been tentatively classified with the latter in this study. 
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5.11.2 Question marking in Turkic 


The following will briefly describe question marking in the Oghuz language Turkish, 
which is actually located outside of the Northeast Asian area but may serve as a ref- 
erence point for the other Turkic languages. Turkish has a sentence-final particle =mI 
that is usually written detached from its host but is best analyzed as enclitic. It has the 
vowel harmonic variants listed in Table 5.133. There is no variation in the consonant that 
invariably has the nasal shape m. 


Table 5.133: Vowel harmonic forms and distribution of the Turkish question 
marker (Góksel & Kerslake 2005: 22, 251; Landmann 2009: 4, 24) 


Preceding vowel Variant of mI 
i, e (front unrounded vowel) mi 
ü, 6 (front rounded vowel) mü 
1, a (back unrounded vowel) mi 
u, o (back rounded vowel) mu 


The morphosyntactic behavior of the Turkish question marker is similar to Proto- 
Tungusic and some modern Tungusic languages, Middle Mongol or Old Japanese. It is 
a mobile particle that also marks focus and alternative questions and is also part of sev- 
eral tag question markers. In focus questions it attaches to the element in focus, which 
receives an an additional peak in intonation (Landmann 2009: 24). Content questions 
remain unmarked. Alternative questions take two markers and have an optional dis- 
junction yoksa. 


(376) Turkish 
a. Nermin okul-a git-mig-mi? 
PN school-DAT go-PFV.EV=Q 
‘Has Nermin gone to school?’ 


b. Nermin okul-a=mi ` git-mis? 


PN school-DAT=Q go-PFV.EV 
‘Has Nermin gone to school?’ 

c. Cemal okul-a git-ti-mi, (yoksa) git-me-de-mi? 
PN  J school-paT go-Prv-Q (or) gO-NEG-PFV-Q 


‘Did Cemal go to school or not?’ 
d. Ev-e ne zaman gid-ecek-sin? 
home-pAT what time go-FUT-2sG 
"When will you be going home?’ (Góksel & Kerslake 2005: 257, 254, 259) 
Turkish has tag questions marked with the demonstrative óyle ‘like this’ or the nega- 


tive copula degil followed by the regular polar question marker mi (Góksel & Kerslake 
2005: 253). 
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(377) Turkish 
a. Cemal bugiin okul-a git-me-di, degil mi? 
PN today school-DAT go-NEG-PFV NEG Q 
‘Cemal didn't go to school today, did(n’t) he?’ 


b. Esra Handan-in abla-si-ymis, óyle mi? 
PN PN-GEN ` elder.sister-3SG.POSS-EV.COP so Q 


‘So Esra is Handan's elder sister, right?’ (Góksel & Kerslake 2005: 253) 


Similar tag question markers derived from demonstratives and negative copulas are also 
known from Mongolic (§5.8.2). With this reference point in mind we can now address 
those Turkic languages spoken in Northeast Asia. 

In Salar, the only Oghuz language located in Northeast Asia, polar questions appear 
to obligatorily take one of several sentence-final question markers. One marker has the 
vowel harmonic variants mu, mo, and mi and is most likely cognate with with Turkish 
ml. Another marker has the two forms u and o and can be compared with the Mongo- 
lian polar question marker =(y)UU that has the two vowel harmonic forms -(y)uu and 
-(y)oo. It exhibits a short vowel in some, especially Shirongolic languages that are in 
close vicinity to Salar. The functional difference between the two markers =mU and =U 
remains unclear, but they appear to be mutually exchangeable (Lin Lianyun 1985: 91). 
The vowel harmonic forms of both particles have a somewhat unclear distribution. For 
example, all three variants, mu, mo, and mi, are apparently possible in example (3782). 
The fact that there are different numbers of vowel harmonic variants can be explained 
by the different origin of the question markers. Both have been reanalyzed as enclitics 
here. 


(378) Salar 
a. sen af if-dgi-mu? 
2sc noodle drink-PsT.DEF=Q 
‘Have you eaten (yet)?’ 
b. u ge(l)-mif=u? 
3SG come-PST.INDEF=Q 
‘Has (s)he come?’*? (Lin Lianyun 1985: 90, 91) 
Salar content questions are almost always marked with the sentence-final -i, which is 
likely to be related to Tuvan JL Dukhan JL Tofa -(u)l, Yakut -(n)rj, and Dolgan -ij, all of 


which are restricted to content questions (see below). The forms have all been analyzed 
as suffixes here, but some might have an enclitic status. 


(379) Salar 


33The meaning of the parentheses in this last example is not entirely clear, but could indicate either optional 
elements or parts of the verb stem that are lost in combination with the suffixes. 
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ana, sen cala  va(r)-wur-i? 
girl 2sc whither go-FuT-Q 


‘Miss, where are you going?’ (Lin Lianyun 1985: 116) 


However, in one example Lin Lianyun (1985: 82) has an example with a copula deri 
that possibly contains the question marker, though this was left unanalyzed. The marker 
-iis also absent after verbs with a definite past marking (Lin Lianyun 1985: 71). 


(380) Salar 
sen nag-a ` vulo gel-d3i? 
2sc what-paT for come-PST.DEF 


"Why have you come?' (Lin Lianyun 1985: 86) 


Furthermore, Salar has a question suffix -du ~ -do that is directly attached to a verb 
stem and is said to be connected to the category of evidentiality. It is only used if one 
has observed oneself that the addressee has finished a certain action. 


(381) Salar 
sen if-du? 
2sc drink-Q.SELF.EV 
‘(I see that) you have finished drinking?’ (Lin Lianyun 1985: 71) 


The very specific meaning as well as the morphosyntactic behavior make it implau- 
sible to assume a connection with the Yakut and Dolgan question marker =duo ~ =duu 
that we will encounter further below. Like many other languages spoken in China, Salar 


has borrowed the Mandarin question marker ba HE. 


(382) Salar 
u si-(niyi) gaga-n ira ba? 
3sc 2sc-(GEN) e.brother-?2sc.Poss COP Q 
‘This is your elder brother, right?’ (Lin Lianyun 1985: 84) 


Tatar, Kazakh, and Kyrgyz are the three Kipchak languages spoken in Northeast Asia. 
I will address them in turn. Tatar as spoken in China has a cognate of the question marker 
in Turkish and Salar that has the form =ma ~ =mi (Chen Zongzhen & Yi Liqian 1986: 30). 
It has likewise been reanalyzed as enclitic here. As opposed to Salar, however, content 
questions remain unmarked. 


(383) Tatar (China) 
a. sin qajsa-sa-n ala-san? 
2sG which-3sc.Poss-Acc want-2sG 
"Which one do you want?' 
b. ismenil abzij tyrkije-ge bar-ma-ran-mo? 
PN uncle PN-DAT  go-NEG-PTCP.PST-Q 


‘Has uncle Ismeril (younger brother of the father) never been to Turkey?’ 
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c. siz ilgeri yalaq  bartfasa-&a bar-wan i-di-giz=mi? 
2sG.POL before people park-DAT go-PTCP.PST COP-PST-2SG.POL=Q 
‘Have you been to the People's Park before?’ Chen Zongzhen & Yi Ligian 
1986: 147, 111) 


The situation is very similar to Tatar as spoken in Russia, for which there is more infor- 
mation on intonation: In polar questions the question marker is optional and “the pitch 
is high on the last accented syllable of the clause” (Poppe 1963: 126). Content questions 
remain unmarked and have either falling intonation with two elements (384b) or first 
rising and then falling intonation with more elements (384c). 


(384) Tatar (Russia) 


a. zükiyá kil-de-me? 
PN | come-PST-Q 


‘Did Zakiya come?’ 
b. närsä bar? 
what coP 
"What is it?’ 
c. kicü sez-gü kem kil-de? 


yesterday 2PL-DAT who come-PsT 


"Who came to you yesterday?' (Poppe 1963: 126) 


As compared with the other Turkic languages mentioned in this chapter, questions 
in Kazakh are exceptionally well described (e.g., Geng Shimin & Li Zengxiang 1985; 
Muhamedowa 2016: 17-24). However, only some aspects of question marking in Kazakh 
can be included here. For further information, the interested reader is referred to the 
specialized description of Kazakh interrogative constructions by Zhang Dingjing (1991). 

Polar questions are either marked by rising intonation or a particle. Rising intonation 
has an additional semantic component of surprise, e.g. ol oqow/2? ‘Is (s)he a student?’ 
(Zhang Dingjing 1991: 99). The question particle =MA is often written detached from its 
host but probably has the status of an enclitic. But the enclitic is not mobile and thus 
cannot mark focus questions as in Turkish (Muhamedowa 2016: 17). It marks polar and 
alternative questions. Alternative questions may take an optional disjunction ælde. 


(385) Kazakh (China) 


a. akan da bar-do-ma? 
e.brother also go-psT=Q 


"Did your elder brother go as well?' (Geng Shimin & Li Zengxiang 1985: 119) 


b. ol oga.wfa=ma, oqo.t.2wf2-ma? 
3sc student=Q ` teacher-o 


‘Is (s)he a student or a teacher?’ 
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c. olar faqar-ma-da=ma ælde bar-xan ` kel-me-di-me? 
3PL call-NEG-PST-Q  orQ go-PsT.PFV come-NEG-PST-Q 
"Didn't they call you or did you (simply) not come?' (Zhang Dingjing 1991: 
99, 104) 


According to Muhamedowa (2016: 65) standard Kazakh älde is primarily used in the 
written language and is restricted to questions. Standard disjunction is expressed with 
the help of nemese ‘or’. This is a distinction also found in Mandarin Chinese háishi 3^ 
ze (interrogative) versus hudzhé 9X (standard) (§5.9.2.1). In Chinese Kazakh, negative 
alternative questions either take two markers and an optional disjunction (386a) or, if 
the second alternative consists of the negative existential exclusively, only the first al- 


ternative is marked (386b). In this case, the second alternative simply consists of the 
negator dog, This may have been influenced by Uyghur as there are examples with 
question markers from Kazakhstan (387b). Spoken Kazakh loses its agreement markers 
if the question marker is present. The sentence kel-e-di=me? 'come-PRs-3sc-Q' in written 
Kazakh thus has the spoken equivalent kel-e=me? (Muhamedowa 2016: 168). The same 
phenomenon can be observed in the following alternative question. 


(386) Kazakh (China) 
a. ol kel-e=me, kel-me-j=me? 
3SG come-PRS-Q come-NEG-PRS-Q 
‘Does (s)he come or not?’ 
b. sara-maj-da dzaqsə kgr-e-sin=be | d3oq? 
yellow-oil-DAT good  see-PRs-2sG-Q NEG 
‘Do you like butter?’ (Zhang Dingjing 1991: 104) 


(387) Kazakh (Kazakhstan) 
a. ornija orir-a-sin=ba, orir-ma-y-sin=ba? 
place sit-pRs-2sG=Q sit-NEG-PRS-2sG=Q 
"Are you going to sit down at your place or not?’ 
b. ornija orir-a-sin=ba, Zoq-pa? 
place sit-pRs-2sG=Q NEG=Q 


‘Are you going to sit down at your place or not?’ (Muhamedowa 2016: 18) 


In sentences with second person singular agreement forms, the particle =MA often 
has the appearance of a suffix -MI that precedes the agreement suffix. But apparently 
both constructions are usually possible in such cases. 


(388) Kazakh (China) 


a. nurbek-sin=ba? 
PN-2SG=Q 
"Are you Nurbek?’ 
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b. nurbek-pi-sin? 
PN-Q-2SG 
‘Are you Nurbek?’ (Geng Shimin & Li Zengxiang 1985: 120) 


The enclitic and the suffix both follow vowel harmony and depend on the preceding 
consonant, but have different realizations (Tables 5.134, 5.135, see also Kirchner 1998a: 
328). 


Table 5.134: Realizations of the Kazakh enclitic =MA (Geng Shimin & Li Zengx- 
iang 1985: 119, passim, Zhang Dingjing 1991: passim, and Muhamedowa 2016: 


17) 
after back vowels after front vowels 
after r, j, V =ma =me 
after voiced C =ba =be 
otherwise =pa =pe 


Table 5.135: Realizations of the Kazakh suffix -MI preceding second person 
agreement forms (Geng Shimin & Li Zengxiang 1985: 119, passim; Muhame- 
dowa 2016: 17) 


after back vowels after front vowels 
after r, j, V -mə -mi 
after voiced C -bə -bi 
otherwise -pe -pi 


In spoken but not written Kazakh there is a tendency for the enclitic to lose the vowel 
harmony in favor of the back vowel variants ma, ba, pa (Muhamedowa 2016: 17). Content 
questions do not have a particle or suffix but exhibit rising intonation (Muhamedowa 
2016: 20). 


(389) Kazakh (China) 
siz qajda bar-a-siz? 
2sG.POL where go-PRS-2sG.POL 
"Where do you go?' (Geng Shimin & Li Zengxiang 1985: 68) 
A marker specialized to inquire about topics that is restricted to the spoken language 
takes the form -/I (i.e. fa ~ fi) or =/e (Zhang Dingjing 1991: 103, Muhamedowa 2016: 19). 


(390) Kazakh (China) 
omar=fe? 
PN=Q 
"What about Omar?’ (Geng Shimin & Li Zengxiang 1985: 121) 
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A marker with a function similar to tag question markers found in content questions 
takes the form & and is thus similar to the marker a in Kalmyk in both form and function 


(§5.8.2). 


(391) Kazakh (China) 
muna sagan kim ajta-p — ber-di, ae? 
this 2sc who tell-cvs give-PsT Q 


"Who told you that, eh?' (Geng Shimin & Li Zengxiang 1985: 131) 


In this function æ is accompanied by rising intonation and has the specific meaning “that 
the speaker remains baffled by the whole situation of a (certain) circumstance even after 
giving it much thought” (E zr Vti AUGE SRI RA C9 ELI HS Zhang Dingjing 
1991: 103). Muhamedowa (2016: 19) simply treats standard Kazakh ä as a tag question 
marker and translates it as ‘right?’. It is accompanied with rising intonation. 

The question marker in Kyrgyz has the form -BI (Kirchner 1998b: 346). It takes the 
form =b] after z, m, n, n, l, r, w, and j but the form -pl after any of the following sounds: 
p, t, k, s, $ č and x (Kara 2003: 38). As opposed to Kazakh, there is no variant with an 
initial nasal. It can attach to variable word classes, which is why it has been analyzed as 
an enclitic as in Kazakh. 


(392) Kyrgyz (Kyrgyzstan) 
al kel-e-bi? 
3sG come-PRS-Q 


‘Is (s)he coming?’ (Kara 2003: 39) 


Basically the same situation as in Kyrgyz proper can be observed in Kyrgyz as spo- 
ken in China. Here the enclitic has four (as opposed to two in Kazakh) different vowel 
harmonic variants: =Ba, =Bi, =Bu, and =By (Hu Zhenhua 1986: 155). 


(393) Kyrgyz (China) 
bul kitep=pi? 
this book=Q 
‘Is this a book?’ (Hu Zhenhua 1979: 89) 


In the Teskei dialect of Kyrgyz spoken in Xinjiang (tiesikai SE Bir 2. in Chinese) the 
question and agreement markers have the reversed order compared to Kyrgyz proper, 
but only in the second person. 


(394) Kyrgyz 
kel-e-sing-bü? 
go-PRS-28G-Q 


"Are you going?' 
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(395) Kyrgyz (Teskei) 
kel-e-bi-sin? 
gO-PRS-Q-2SG 
‘Are you going?’ (Makelaike Yumai'erbai 1986: 23) 


This unusual morphosyntactic alternation suggests influence from Kazakh or Uyghur. 
No examples for focus or alternative questions were found in the literature available to 
me. Content questions are unmarked. 


(396) Kyrgyz (China) 
qaijda bar-a-san? 
whither go-PRs-2sG 
"Where are you going?’ (Hu Zhenhua 1986: 174) 


Apart from the question marker mentioned above, there is a whole range of additional 
constructions with fine semantic differences. Topic questions take the marker -t/I (i.e. 


tfa ~ =tfi ~ =tfu ~ -tfy), a cognate of Kazakh -/I ~ =/e. 


(397) Kyrgyz (China) 
men bar-a-man, siz=tfi? 
ISG go-PRS-1sG 2sG=Q 
‘I have to go, what about you?’ (Hu Zhenhua 1986: 155) 


Kyrgyz has a marker beken that is a contraction of the polar question marker with a 
particle eken. Similarly, there is a question marker bejim that derives from a combination 
of the question marker with dejim. 


(398) Kyrgyz (China) 


a. al qor&az beken? 
3sG PN Q 


‘Isn’t he Kyrgyz?’ 
b. al tatar bejim? 
3sGPN Q 


‘(S)he apparently is Tatar, right?’ (Hu Zhenhua 1986: 155, 156) 


The exact meaning of these forms remains unclear to me. 

There are two Uyghur-Karluk languages spoken in Northeast Asia as defined here, 
Uyghur and Uzbek. Uzbek is located for the most part outside of the region but is also 
spoken by a minority in Xinjiang, which is why it has been included here. In addition, 
there is the Uyghur-Persian mixed language Eynu that is included in this chapter because 
of the apparent similarities in question marking to Uyghur. Question marking in Uyghur 
is rather complex. Fortunately we are in possession of descriptions of Uyghur from the 
19th century. According to the description by Shaw (1878: 56), Uyghur has a marker 
=mu. Similarly to Kazakh there is a split that in Uyghur depends on the tense affix. The 
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default position of =mu is the very end of the sentence. However, if the non-past marker 
is present, the question marker has the shorter form -m and precedes the agreement 
marker (Shaw 1878: 56). 


(399) Uyghur 
a. qel-ding-mu? 
do-PsT=Q 
‘Did you do?’ 
b. qel-a-m-san? 
do-NPST-Q-2sG 
‘Do you do?’ (Shaw 1878: 56) 


In fact, Shaw's explanation is more likely than the one by Tuohuti Litifu (2012: 366) 
and Abdurehim (2014: 178), who claim that in non-past sentences the marker takes the 
form -am ~ -dm. Most likely, the element -a ~ -á is a variant of the non-past marker and 
only -m is the question marker. 


(400) Uyghur 
Gta kel-ál-à-m-siz? 
tomorrow come-ABIL-NPST-Q-2SG.POL 


‘Can you come tomorrow?’ (Tuohuti Litifu 2012: 366) 
But the descriptions agree that the question marker usually has the invariable form =mu. 


(401) Uyghur 
sün tapsuruq-ni išlä-p bol-dun=mu? 
2sc homework-Acc make-cvB aux-PST.2SG=Q 
‘Have you finished your homework?’ (Tuohuti Litifu 2012: 219) 


Shaw (1878: 56) states that, colloquially, =mu may have the form =ma. But more likely 
the form =ma is a combination of =mu and another marker =a that expresses astonish- 
ment (Tuohuti Litifu 2012: 366). 


(402) Uyghur 
bayiqi  gáp-ni  anli-mi-din=ma? 
just.now talk-Acc hear-NEG-PST=Q 
‘Didn’t you hear what was just said?’ (Tuohuti Litifu 2012: 367) 


However, the Lopnor dialect of Uyghur does not show the fusion with the tense 
marker “when the second person or plural suffixes are attached” and also has a vari- 
ant mi, e.g. yürü-y-mi-siz “go-NPST-Q-2sG.POL’ (Abdurehim 2014: 178f.). But the author 
clearly contradicts himself and also gives the following example: sat-a-m-sán 'sell-NPsT- 
Q-2sG (Abdurehim 2014: 208). Polar questions in Uyghur can also be formed by rising 
intonation alone (Abdurehim 2014: 208). Content questions remain unmarked. 
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(403) Uyghur 


a. nima bar? 

what exist 

‘What is there?’ (Shaw 1878: 81) 
b. bu nema? 

this what 


"What is this?' (Tuohuti Litifu 2012: 367) 


Alternative questions have an optional disjunction yaki and two obligatory question 
markers. Most languages with two question markers have the identical question marker 
on the respective alternatives. The identical part of the two alternatives that is prone to 
ellipsis usually remains unmarked. Such a situation can also be found in Uyghur (see 
example 406a below). However, there is a typologically very special situation in Uyghur 
in which the first of the two markers attaches to the identical element of the two alter- 
natives and has a form different from the second marker. 


(404) Uyghur 
bügün kel-a-m-siz (yaki) áti-mu? 
today come-NPST-Q-2SG.POL Or tomorrow=Q 


‘Do you come today or tomorrow?’ (Tuohuti Litifu 2012: 321) 


A similar construction from Kazakhstan has been mentioned by Muhamedowa (2016: 
18) but was left unexplained as the Uyghur pattern. In this example, the first question 
marker retains its shape, but see above on Kazakh for cases in which the agreement 
marker follows the question marker as in Uyghur. 


(405) Kazakh (Kazakhstan) 
samsung al-a-sin=ba, ayfon=ba? 
PN buy-PRs-2sG-Q PN=Q 


"Will you buy a Samsung or an iPhone?’ (Muhamedowa 2016: 18) 


The form of negative alternative questions depends on the clause type that determines 
the type of negator. 


(406) Uyghur 
a. sdn zadi bar-a-m-sán, bar-ma-m-sün? 
2sc actually go-PRs-Q-2sG go-NEG-Q-2sG 
"Will you actually go or not?' 
b. saga jan kerük-mu áàmás-mu? 
2sG.DAT life need=Q — NEG-Q 


‘Do you want to live?’ 
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c. ba$qi-lar-din al-yan qárz-ni qaytur-al-a-m-sán yoq? 
other-PL-ABL take-p.pFv debt-Acc pay-ABIL-Q-286 NEG 


‘Can you pay the debt you have from the others or not?’ (Tuohuti Litifu 
2012: 368) 


Notice the absence ofthe second question marker in the last example with the negative 
existential yoq, a situation already encountered in Chinese Kazakh. There are several 
additional sentence-final question markers, the meaning of which is not absolutely clear, 
e.g. yu, du (Abdurehim 2014: 208). But há is clearly a question tag, and ču must be cognate 
with the topic question marker from Kyrgyz and Kazakh seen above. 

There are only a few descriptions of the mixed language Eynu. But the same question 
marker =mu as in Uyghur is present. Consider the following pair of negative alternative 
questions in Eynu and Uyghur. 


(407) Eynu 
xani-da | mike hes-mu, nist-mu? 
house-Loc goat COP-Q | NEG-Q 


(408) Uyghur 
ój-de ófke bar-mu joq(=mu)? 
house-Loc goat coP-Q  NEG(-Q) 
"Are there any goats in the house or not?' (Wurm 1997: 245) 


Lee-Smith (19962: 860), who has the same example in a slightly different transliteration, 
does not have the second question marker in Uyghur. As can be seen, the grammatical 
structure of the Eynu example is nearly identical to Uyghur. However, the lexical stems 
have a Persian origin. According to Zhao Xiangru & Aximu (2011: 315), there is only one 
question marker in Uyghur but two in Eynu. Content questions in Eynu are unmarked 
as in Uyghur. 


(409) Eynu 
ma jek tfefme kes kim? 
this one eye person who 
(410) Uyghur 
ma jektfefme kifi kim? 
this one.eyed person who 
‘Who is this one-eyed person?’ (Zhao Xiangru & Aximu 1981: 46) 


In this example Eynu kes is of Turkic origin while Uyghur has the same Persian loan- 
word for ‘one-eyed’ as Eynu. Content questions in Eynu remain unmarked. 


(411) Eynu 
siz nid3ej-din fies bol? 
2sG.?POL where-ABL come.PsT.2sG 
‘Where did you come from?’ (Tooru et al. 1999: 31) 
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Uyghur has the two tag question markers sundaq- mu and sundaq-qu that contain the 
polar question marker or another unidentified marker in combination with a demonstra- 
tive. 


(412) Uyghur 
ular bügün kál-má-ydu, Sundaq=qu? 
3PL today come-NEG-3NPST just.so=Q 
"You are not coming today, right?’ (Tuohuti Litifu 2012: 322) 


Topic questions such as sán-cu? ‘what about you?’ exhibit a marker -ču (Tuohuti Litifu 
2012: 218) that is probably cognate with the forms seen before in Kazakh and Kyrgyz. The 
form dubitative m'ikin ‘is it?, may it be?’ (Shaw 1878: 81), is cognate with Kyrgyz beken 
and "expresses more of hesitancy between two opinions than the simple mu" (Shaw 1878: 
56). 

One of the least known Turkic languages in NEA is probably Ili Turki (Zhao Xiangru & 
Aximu 1985). There are only a handful of examples for questions. But these are sufficient 
to illustrate the question marker =MA. 


(413) Ili Turki 
a. bar-di-ma? 
go-PsT-Q 
‘Did (she) go?’ 
b. bar-di-q=pa? 
gO-PST-1PL=Q 
‘Did we go?’ (Hahn 1991: 31) 
Table 5.136 illustrates that the difference from surrounding Turkic languages is mostly 
phonological in nature. 


Table 5.136: A comparison of three interrogative sentences in six Turkic lan- 
guages (Zhao Xiangru & Hahn 1989: 278f., slightly modified) 


‘Did he give?’ ‘Did you see?’ ‘Did we go?’ 
Ili Turki berdi-mà? kordür-bà? bardiq-pa? 
Kazakh berdi-me? kórdig-be? bárdiq-pa? 
Kyrgyz berdi-mi? kórdür-bü? bardiq=p1? 
Uzbek berdi=mi? kordin=mi? bardiq=mi? 
Xinjiang Uzbek berdi=mi? kérdin=mi? bardiq=mi? 
Uyghur bardi=mu? kórdür-mu? barduq=mu? 


Uzbek has a polar question marker =mi, which is “accompanied by a rising pitch in 
the preceding syllable.” (Boeschoten 1998: 373). The same marker is present in Uzbek as 
spoken in Xijiang (Table 5.136). As in Uyghur, the marker has one form only. It can attach 
to different word classes. 
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(414) Uzbek 
a. mümkin-mi? 
possible=Q 
‘Is it possible?'?* 
b. futbol oyin-ga ` bar-aylik=mi? 
football match-pAT go-vor-Q 
"Shall we go to the football mtach?' (Boeschoten 1998: 360, 368) 


When second person agreement forms are present, the question marker may either pre- 
cede or follow the agreement marker. 


(415) Uzbek 
a. sáy-mi-siz? 
good-Q-2PL 
"Are you well?’ 
b. kéla-siz=mi? 
come-2PL=Q 
"Will you come?’ (Boeschoten 1998: 373) 


By now this phenomenon should be familiar from several languages seen before. As ex- 
pected, alternative questions take two markers and content questions remain unmarked. 
Note the alternative question following a content question (§4.4). 


(416) Uzbek 
a. ména  qaysisi yaras-adi, —qizili-mi, áqi-mi? 
1sG.DAT which  suit-?cop.PsT red=Q white-Q 
"Which one suits me, the red one or the white one?' 
b. kecà ^ kim-lar kel-di? 
tonight who-PL come-PsT 
"What persons have arrived tonight?' (Boeschoten 1998: 371) 


From the Siberian branch ten southern and two northern languages are included in 
this study. Let me first address the southern subbranch. Tuvan has the expected polar 
question marker be. But there is a marker -Il that appears to be developing into a content 
question marker, e.g. kim-il? “Who is it?’ (Anderson & Harrison 1999: 88f.). The marker 
is optional as there are also unmarked content questions. 


(417) Tuvan 


a. xlep bar=be? 
bread cop=Q 


‘Is there bread?’ 


34This may also be pronounced as miimkimmi. 
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b. kayiin kel-gen siler? 
whence come-PsT.I 2PL 


"Where have you come from?' (Anderson & Harrison 1999: 69, 28) 


The situation in Dzungar Tuvan spoken in China is almost identical. But the polar 
question marker is -BA with vowel harmony as well as variation of the consonant, both 
of which are absent in Tuvan proper. 


(418) Tuvan (Dzungar) 

a. ol gel-di-be? 
3sc come-psT.II=Q 
‘Has (s)he come?’ 

b. sen cazan d3or-ur-sen? 
2sc when leave-?ruT-2sc 
"When will you leave?' 

c. suu-bas-tan birsi qajda-l? 
water-1PL.POSS-? ABL some where-Q 


‘Where has some of our water gone?’ (Wu Hongwei 1999: 113, 144, 54) 


Wu Hongwei (1999: 146) mentions that questions may exhibit a certain question intona- 
tion, but leaves open the details. 

Basically the same pattern seen in the two varieties of Tuvan above can also be found 
in Dukhan. Again, the polar question marker =BA has more varieties than in standard 
Tuvan and the content question marker is optional. 


(419) Dukhan 

a. bata gel-gen-be? 
PN come-POST-Q 
‘Has Bat arrived?’ 

b. gim gel-gen? 
who come-POST 
"Who has come?’ 

c. am pis ganjaar-al? 
now 1PL which.v.INTRA.LF-Q 
‘And now what should we do?’ (Ragagnin 2011: 193, 131, 188) 


In Dukhan, the content question marker Jl is said to have an additional intensifying 
function, which would explain its absence in some sentences in Tuvan as well. 

A negative tag question in Dukhan has the form of a negative copula followed by the 
polar question marker. 
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(420) Dukhan 
gel-gen ^ emes-pe? 
come-POST NEG-Q 
‘(S)he arrived, didn’t (s)he? (Ragagnin 2011: 187) 


This pattern has the form eves-be in Tuvan (Harrison 2005: 23) and emes-pe in Dzun- 
gar Tuvan (Mawkanuli 2005: 209). In Sarikoli there is a similar question tag emes he: with 
a question marker that is said to also occur in Uyghur (Tooru et al. 1999: 31-32). There 
is an areal connection of this construction to similar constructions in Mongolic (e.g., 
Mongolian bish=uu, §5.8.2), as well as other Turkic languages (e.g., Turkish degil=mi, 
see above). Tag questions in Tuvan have the final element ale, Dukhan has hala ~ haran 
(Ragagnin 2011: 187). 

Tofa (previously also called Karagas), like Tuvan, has the invariable marker =be (Schó- 
nig 1998: 414). According to Castrén (1857b: 71):, there is a variation between two forms 
-bé ~ -pé that can attach to different word classes, which is why they can be considered 
enclitics. Alternative questions, for which no example was given, take the same marker 
on each alternative. Content questions have a marker -(u)l that is cognate with Dukhan 
-(DI etc. 

(421) Tofa 
a. onu soodap beer-be? 
3SG.ACC say.CVB OBJ.VERS.P/F=Q 
"Should I say it again (for you)?' (Anderson 2001: 260) 
b. ad-in qum-ul? 
name-2sG.POSs who-Q 
"What is your name?' (Schónig 1993: 199) 


Question marking in Tofa is thus almost identical to Tuvan. 

For Khakas (previously also called Koibal), Castrén (1857b: 71) mentions a polar ques- 
tion marker -BA (i.e., -ba, -bd, -pa, -pä) that should be considered an enclitic as well. There 
are additional variants with an initial nasal not mentioned by Castrén (Anderson 1998: 
passim). Alternative questions take two markers and content questions are unmarked. 


(422) Khakas 


a. srrer xakas-ta-p cooxta-péa-zar-ba? 
2PL PN-V-CVB speak-PRS-2PL=Q 
‘Do you (pl.) speak Khakas?' 
b. kem pil-er ani? 
who know-FUT 3sG.AcC 
"Who knows him/her?' (Anderson 1998: 87) 


The following polar and focus questions were given to me in January 2016 by a na- 
tive Khakas living in Germany with the help of several Khakas speakers in Russia. The 
transcription and analysis roughly follow Anderson (1998). 
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(423) Khakas 
a. sin tan.da, škola par-éa-zyn? 
2sc tomorrow school go-PRs-2sc 
‘Are you going to school tomorrow?’ 
b. sin tan.da, škola par-¢a-zyn? 
2sG tomorrow school go-PRs-2sc 
‘Is it you who is going to school tomorrow?’ 
c. tan.da, sin škola par-¢a-zyn? 
tomorrow 2sc school go-PRs-2sc 
'Is it tomorrow that you are going to school?' 
d. sin škola, tan.da par-ča-zyņ? 
2sG school tomorrow go-PRs-2sG 


‘Is it to school that you are going tomorrow?’ 


No polar question marker was present. Detailed information on intonation is not avail- 
able to me, but focus is also clearly marked with word order, especially sentence initial 
position. In one instance the focused element stands in second position. 

However, Khakas has a special interrogative verb ending -3An found in content ques- 
tions that expresses "semi-rhetorical utterances like ‘how is it possible that...?', when the 
speaker believes what they are questioning to in fact not be possible or appropriate" (An- 
derson 1998: 38). It often combines with interrogatives such as noya ‘why’, xaydi ‘how’ 
or xaydar ‘whither’. 


(424) Khakas 
xiyir-yan čir-zer — xaydi par-ba- ag? 
invite-PPsT place-ALL how  go-NEG-Q 


‘How is it possible not to go where one was invited?’ (Anderson 1998: 38) 


Given that -2A7 originally expressed the habitual past, a connection to Samoyedic lan- 
guages seems possible (§5.12.2). In Enets and Nenets, for example, the suffix -sa used to 
be a past tense marker but acquired an interrogative meaning as well. A difference is 
that it can be found in both polar and content questions. 

According to Hu Zhenhua & Imart (1987: 29) Fuyu has a question marker -BA that 
has at least eight different realizations, ba, bi, pa, pi, ma, mi, Ba, and fi. They write the 
question marker attached to the preceding word, but it has been analyzed as enclitic here 
as it can be attached to a verbal or non-verbal host. Content questions remain unmarked, 
e.g. ol gim? ‘Who is (s)he? (Hu Zhenhua & Imart 1987: 37). 


(425) Fuyu 
ol sin-in balan=ma? 
3sG 2SG-GEN son=Q 


‘Is he your son?’ (Hu Zhenhua & Imart 1987: 29, 37) 
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In Sarig Yughur the polar quesion marker has the form =mi and an older variant =pi 
after plosives as well as a stressed version =m (Roos 2000: 152). It has the reduced form 
-m after the past tense suffix -(")ti and the evidential -ti. 


(426) Sarig Yughur 
a. yahs=mu? 
good-Q 
‘How are you?’ (a greeting) 
b. sen uzi-"ti-m? 
2sc sleep-PST-Q 


"Did you sleep?' (Roos 2000: 152) 


The variant =mu as well as the suffix -m indicate strong influence from Uyghur. The dif- 
ference from Uyghur only concerns the lack of a person marker following -m, but this 
category is altogether absent from Sarig Yughur (Roos 2000: 100). In the first two exam- 
ples =mi is also possible. According to Chen Zongzhen (1982: 76) the question marker 
has the slightly different form =be, =me ~ -m, but no example of the former has been 
given. The distribution of =me and -m confirms Roos’ description. (Negative) alternative 
questions take two question makers. 


(427) Sarig Yughur 
ko ki-sti-m ki-yimsti-m? 
3sG come-FUT.EV-Q come-FUT.NEG.EV-Q 
"Will (s)he come or not?' (Roos 2000: 152) 


Roos mentions another example of an alternative question with a very unusual struc- 
ture. If there is no mistake, the sentence has a question marker at the end of the sentence 
in addition to the two markers already present. Moreover, there is a disjunction that fol- 
lows both alternatives. 


(428) Sarig Yughur 
sen puyin-mi, tayin-mi ta" qi c'unSan-qa kuncuola-s=mi? 
2sc today=Q tomorrow=Qor  chief-paT work-rur-Q 
"Will you work for the chief today or tomorrow?’ (Roos 2000: 152) 


In Shor the question marker has the variants =ba, =be, =pa, =pe, =ma, =me (Donidze 
1997: 505), which is identical to Altai Turkic. Polar questions take one (Nevskaja 2000: 
291), content questions are unmarked, alternative questions take two markers. 


(429) Shor 


a. kem kel-du? 
who come-PsT 


"Who came?’ 
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b. ak kazyn ózer-be ¢ok=pa? 
white ?birch ?grow=Q NEG=Q 


‘Is the white birch tree growing or not? (Donidze 1997: 505) 


The question marker in Altai Turkic has the same variants =ba, =be, =pa, =pe, =ma, 
-me as in Shor (Baskakov 1997: 183). In addition there is an enclitic =na ~ =ne. The dif- 
ference in meaning has not been specified, but the latter marker may have dubitative 
meaning. 


(430) Altai Turkic 


a. albyn=ba? 
?receive=q 


‘Have you received?’ 


b. olor kel-gen=ne? 
3PL come-?P.PST=Q 


‘(I wonder whether) they came?’ (Baskakov 1997: 183) 


Content questions appear to be unmarked, e.g. kajda sin? "Where are you?’ (Baskakov 
1958a: 104). 

The newly identified South Siberian Turkic language Chalkan offers a picture that 
strongly resembles its closely related languages such as Altai Turkic. Polar questions 
take a sentence-final question marker and content questions remain unmarked. 


(431) Chalkan 

a. eme t'etire uyuqta-p-sin=mi? 
now till sleep-PRs-2sc-Q 
‘Have you been sleeping till now?’ 

b. te, qanéin-dan — kecirsin? 
well how.much-as1 ferry.PRs.2sG 
"Well, how much is it for you to ferry us (over the river)?’ (Nevskaja 2014: 
78) 

c. kem-nin tiz-i apaas’ aa-n=ma_ meen=me? 
who-GEN tooth-3sc.Poss rather.white 3sG-GEN=Q 1sG.GEN-Q 
"Whose teeth are especially white, his/hers or mine?' (Erdal et al. 2013: 98) 


The exact variants of the question marker remain unclear. 

Little material is also available for questions in Chulym Turkic (also called Os), but 
some material based on fieldwork has been published by Harrison & Anderson (2003) 
and Anderson & Harrison (2006). Unfortunately, their data do not contain an example of 
a polar question, but there are several content questions as well as one open alternative 
question in which the second part is ili qajdury? or what?’ (Anderson & Harrison 2006: 
57). The disjunction ili as well as the accompanying construction has been borrowed 
from Russian. Content questions are unmarked. 


350 


5.11 Turkic 


(432) Chulym 
kajnaar bar-eydi-n? 
whither go-pRs-2sc 
"Where are you going?’ (Harrison & Anderson 2003: 250) 


There is no information on intonation either. Anderson & Harrison (2004: 184) men- 
tion an example of a polar question provided by a Middle Chulym speaker, i.e. uluy=be? 
"Was it big?'. But apparently, the sentence represents an example of code mixing with 
Tuvan. The same question marker also exists in Chulym, but has the form -ba according 
to Birjukovich (1997). 


(433) Chulym 
män par-ad-ym=ba? 
1sG go-PRS-1sG-Q 
‘Do I go?’ (Birjukovich 1997: 496) 


Perhaps the form =be was simply borrowed from Tuvan. However, according to the data 
by Li Yong-Sóng et al. (2008: 97) in Middle Chulym, apart from =ba, there is a second 
variant =bd that follows front vowels and could be identical to =be seen above. 


(434) Chulym (Middle) 
seer) eer-in üp-te-bà? 
2sG.GEN husband-2sc.Poss house-Loc=Q 
‘Is your husband at home?’ (Li Yong-Sóng et al. 2008: 98) 


But interestingly, the question marker does not necessarily stand sentence-final, but 
may be followed by other elements. The description by Li Yong-Sóng et al. (2008) is 
insufficient for a clear analysis. The example is not a focus question in which the mobile 
question marker attaches to the focused element. Instead, one element from the sentence 
simply follows the predicate, which is the host for the question marker. Perhaps this is 
a focus question in which focus is expressed with the help of word order. The following 
sentence was translated as ‘Do you have good news?’, which seems to be inadequate. 


(435) Chulym (Middle) 
siler-nin caqsi-ba vest’? 
2PL-GEN good-Q news 


‘Is your news good?’ (Li Yong-Sóng et al. 2008: 210) 


In Yakut and Dolgan, the two northern Siberian Turkic languages, there is an enclitic 
=duo ~ =duu (Stachowski 1993: 83; Ebata 2011: 197) that marks polar and alternative ques- 
tions and another marker -(n)rj (Ebata 2011: 197) or -Iy (Stachowski & Menz 1998: 423), 
showing vowel harmony, that is found in content questions. The first marker is unique 
among Turkic languages. It can also be found in Kolyma Yukaghir (Nagasaki 2011: 245) 
but does not exist in Tundra Yukaghir (see §5.14.2). 
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(436) Yakut 
a. kyoerasej wrura-tur-n ist-e-sin=duo? 
lark song-3sG.POSS-ACC hear-PRs-2sG-Q 
‘Do you hear the song of larks?’ 


b. oEo-7 kuius-duu uol-duu? 
child-2sc.poss daughter.cop.3sG=Q son.coP.3SG=Q 


‘Is your child a daughter or a son?’ 


c. xanna baar-guin-urj ? 
where exist-COP.2sG-Q 


"Where are you?' (Ebata 2011: 197) 


(437) Dolgan 


a. taahik-ka uu  baar-duo? 
washbasin-DAT water EX=Q 


‘Is there water in the washbasin?’ 
b. tugunan bar-ia-n-iy? 

what.INST go-FUT-2sG-Q 

"What will you go with?' (Li Yong-Sóng 2011: 62, 90) 
c. ikki-duo üs-duo cahy? 

two=Q four=Q hour 

‘for two or four hours?’ (Stachowski 1993: 83) 


Table 5.137 Summarizes question marking in Turkic languages in polar and content 
questions. Except for Salar, Tuvan, Dukhan, Yakut, and Dolgan, the Turkic languages 
in the sample have unmarked content questions. Apart from phonological differences, 
there is very little variation in polar question markers. Only Yakut and Dolgan deviate 
from the rest of the languages, possibly due to Yukaghiric influence. The content ques- 
tion marker in these Turkic languages has an areal connection to the so-called Mongolic 
"corrogative" particle "büi (85.8.2). Similar to Mongolic, the Turkic markers may have 
their origin in a copula form. For Tuvan, Anderson & Harrison (1999: 88) speculate that 
-Il is a quasi-copula that derived from the demonstrative ol ‘that’, which is also the source 
ofthe third person singular pronoun (see also Ragagnin 2011: 188). In fact, a development 
from demonstrative to copula is widely attested, for example for Chinese shi or Rus- 
sian eto/ato. No Turkic language from the sample marks polar and content questions 
in the same way. Those languages for which sufficient data are available indicate that 
alternative questions are usually marked in the same way as polar questions but often 
exhibit an optional disjunction as well. Altogether there is little variation in polar ques- 
tion markers among Turkic languages. Except for Yakut and Dolgan -duo - -duu and 
maybe Salar -U, all polar question markers listed in Table 5.137 appear to be cognate 
with Turkish =mI. The major difference lies in how many variants the question marker 
has in a given language. As has often been observed, a very likely origin of the question 
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marker lies in negation, e.g. Turkish -mA or Old Turkic -mA (Erdal 1998: 151). The po- 
lar question marker was already present in Old Turkic and in Chagatay as =mU, with 
content questions unmarked (Erdal 1998: 152; Boeschoten & Vandamme 1998: 171, 175). 


Table 5.137: Summary of question marking in Turkic 


PQ CQ 
Altai =MA# - 
Chagatay =mU - 
Chalkan =MI# - 
Chulym (Middle) =bA# - 
Dolgan =duo# V-ij 
Dukhan =BA# V-(DI 
Eynu =mu# S 
Fuyu =BA# S 
Ili Turki =MA# ?- 
Kazakh =MA#, V-MI - 
Khakas =BA# - 
Kyrgyz =Bl# - 
Old Turkic =mU - 
Salar =mU#, =U# V-i 
Sarig Yughur =Mi#, =mu# - V-m - 
Shor =MA# - 
Tatar (Chinese) =ml# - 
Tofa =be# V-(u)l 
Tuvan =be# VU 
Tuvan (Dzungar) =BA# V-l 
Uyghur =mu# ~ V-m - 
Uzbek =mi# - 
Yakut =duo# ~=duu# V-(n)ij 


However, in Chuvash, the only extant Oghur language and the most aberrant Tur- 
kic language, there are three question markers that are usually written attached to the 


preceding word, which takes an intonational peak. 


(438) Chuvash 


a. văl yula-na kay-nă-i? 
3sc city-DAT go-PST-Q 
"Did (s)he go to the city?' 
b. val yula-na kay-ná-im? 
3sG city-DAT go-PST-Q 
‘Did (s)he really go to the city?’ 
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c. val yula-na kay-ná-si? 
3sG city-DAT go-PST-Q 
"You mean (s)he went to the city?' (Clark 1998: 450) 


While -i marks plain polar questions, -im is said to express uncertainty or surprise, 
and sentences marked with -ši express doubt and are in need for further confirmation 
(Clark 1998: 450). Alternative questions take two plain polar question markers and an 
optional disjunction e. Content questions are usually unmarked but may also take the 
marker -si. In example (439a) an alternative question follows a content question ($4.4). 


(439) Chuvash 

a. esir asta  purün-a-tür, xula-ra-i (e) yal-ta-i? 
2sG.POL where live-PRs-2sG.POL city-Loc-o or village-Loc-o 
"Where do you live, in the city or in the village?' 

b. muzey ăsśta-ši? 
museum where-Q 
‘Where is the museum?’ 

c. esir asta śural-nă? 
2sG.POL where be.born-PF 
"Where have you been born?’ (Landmann 2014: 34, 62) 


These data show a pattern markedly different from those Turkic languages located in 
NEA. While double marking and disjunctions are attested in other Turkic languages as 
well, there is no formal match to Chuvash. Furthermore, no other Turkic language inves- 
tigated here has a question marker that can appear in both polar and content questions. 
However, there was an additional dubitative marker (Old Turkic erki, Chagatay e(r)ken 
~ é(r)kin) that could also appear in content questions. 

Descriptions of Turkic languages rarely explicitly state the syntactic behavior of in- 
terrogatives. Often, for instance in Uzbek and Kazakh, interrogatives are found in focus 
position in front of the verb (Boeschoten 1998: 373; Muhamedowa 2016: 20). 


5.11.3 Interrogatives in Turkic 


Turkic languages have both KIN- and K-interrogatives. The Proto-Turkic interrogative 
meaning ‘who’ has been reconstructed as "kem by Róna-Tas (1998: 74) or as “kam for 
Oghur (Chuvash kam), and *kim otherwise by Schónig (1999: 64, 69). Oghuz languages, 
but not Salar, differ from other Turkic languages in having derivations from the interrog- 
ative ne ‘what’ instead of "qay- for locative forms (Schónig 1999: 66). In fact, in modern 
Turkish most interrogatives start with an n- (Góksel & Kerslake 2005: 251). In Turkic, 
however, the interrogative meaning ‘what’ (Turkish ne) was originally the only native 
word starting with an n- (e.g., Róna-Tas 1998: 74). This phonotactic anomaly might in- 
dicate borrowing from another language. Comparable forms in Northeast Asia exist in 
Ainuic, Eskaleut, Japonic, Sinitic, and Yukaghiric languages, but none is a very likely 
source of the Turkic interrogative. Stachowski's (2015) claim that "ne derives from a 
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Uralic demonstrative is not very convincing and deserves further evidence. The Uralic 
language Selkup has several interrogatives that have a Turkic appearance and were prob- 
ably borrowed (§5.12.3). 

Many Turkic languages have a difference between a plain velar plosive in ‘who’ and 
an uvular plosive in other interrogatives. A similar phenomenon is known from some 
Mongolic languages (§5.8.3). However, the distribution of the resonances k~, n~, and q~ 
more closely resembles Yukaghiric languages (§5.14.3). Table 5.138 gives an overview 
of cognates of five Turkic interrogatives. In most languages the interrogatives ‘which’ 
and ‘where’ are analyzable synchronically and thus do not qualify as so-called basic- 
level interrogatives. Siberian Turkic languages contain an innovative form combining 
the meanings ‘how’ and ‘which’ that exhibits variation between -n- (e.g., Tuvan kan-dig) 
and -y- (e.g., Yakut yay-day). This variation is already attested in Old Turkic kañu ~ kayu. 
There is some overlap with another group of languages that exhibit an older derivation 
with a suffix -si that is even attested in Chuvash xd-5(é) and Khalaj qani(-si). Apart from 
the latter, these are usually based on the variant with -y- (e.g., Tuvan kay(i)-zi). 

Stachowski (1990; 2015; p.c. 2016) has quite convincingly argued that Yakut tuoy ‘what’, 
apart from Dolgan tuok ~ tuogu, seems to have no cognates in other Turkic languages. 
However, while there are certain phonological problems, there might actually be cog- 
nates in other Siberian Turkic languages (Table 5.139). Note, first of all, that there are 
additional forms that must be connected with tuoy in Yakut. These are the forms mean- 
ing ‘why’ (toyo) and ‘how many’ (tóhó). Despite their apparent differences, they share a 
resonance in t~, which suggests a common origin. Second, in both Yakut and Dolgan, as 
well as the other Siberian Turkic languages, these forms are by and large restricted to 
these three functions. Third, apart from the question of whether there is a sound law that 
connects the Yakut and Dolgan forms with the interrogatives from the other languages, 
they certainly have a similar overall form. Fourth, the fact that all languages are from the 
same branch of Turkic makes it very plausible to seek a common origin for this anomaly. 
Fifth, the forms meaning ‘what’ and ‘why’ in all languages have a different vowel quality 
than the form meaning ‘how many’. 

Tuvan ¢ii-den and Karagas (Tofa) tü-dán ‘why’ contain an ablative instead of a dative 
(Anderson & Harrison 1999: 28; Castrén 1857b: 163). The derivation of some forms such 
as Chalkan t’iig(g)erek, t'ugerek, t'uriq ‘what’ remain unclear to me. There is also an 
interrogative verb ‘to do what’ in Tofa (Coon-) and Chalkan (t'uvet-). 

Stachowski (2015) tried to connect the Yakut form with an Uralic demonstrative stem, 
which is possible but unlikely from a typological perspective. Interrogatives and demon- 
stratives may share paradigmatic similarities and may also grammaticalize into similar 
categories such as relatives, but demonstratives do not usually develop into interroga- 
tives. That this change occurred during borrowing, which in itself is not the most likely 
scenario, is not very plausible either. The only similar form in terms of both meaning and 
form that I was able to find in NEA can be found in Iranian languages (e.g., Sogdian (a)éu 
‘what’). However, a connection in terms of borrowing seems too far-fetched. According 
to Stachowski (2015: 85), tuoy goes back to “to-ok, in which the suffix is an intensifier. 
Possibly, the suffix can be compared with Altai d'u-y ~ cu-y and Chalkan t'ü-y ~ t'u-y/g. 
The other Turkic languages with the unusual interrogative show a palatalized consonant 
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Table 5.138: Cognates of five Turkic interrogatives; Chaghatay taken from 
Boeschoten & Vandamme (1998: 171, 173), Chuvash from Landmann (2014: 32f.), 
Dukhan from Ragagnin (2011: 94), Khalaj from Doerfer (1988: 107f.), and South 
Siberian Turkic partly from Schónig (1998: 410); see the rest of this chapter for 
additional variants 


who what which when where 
Turkish kim ne hangi ne zaman nere, hani 
Chuvash kam mén xa-8(é) xasan asta 
Khalaj kim, kim näy qani(-si) qa cCàn nīrä 
Chaghatay kim ne qay(u)(-si) qacan qani, qan-da, 
qay-da 
Old Turkic kam, kim na kafiu, kayu kacan kafiu-da, 
kayu-da 
Salar kam nang ga-si gaqiang  ga-da 
Tatar kim ni qaj(-sa) qajtfan ^ qaj-da 
Kazakh kim ne, nemene qaj-se qafan qaj-da 
Kyrgyz kim emne qaj-se qatfan qaj-da 
Uyghur kim nemá qay-si qacan na 
Ili Turki kim nema qay-si qacan 
Kashgar kim-tfim nime qa(j)-si qatfan nee 
Uyghur 
Eynu kim nime qaj-si qatfan ne 
Uzbek kim nima qay-si qacàn qani, qayér-dà, 
qaya-ta 
Tuvan kim čüü kay(i)(-zi), kažan kay-da 
kan-dig 
Dzungar cəm dzy-dzi-me caj(2)-so, cazan caj-da 
Tuvan can-doy 
Tofa qum čü qaj-sy, qaj-da 
qan-dyy 
Karagas kèm, kum tü kan-deg kaśan, kai-da 
(Tofa) kähän 
Dukhan gim jà(ü) gae 
Khakas kem nime xay(-zi), xay$an ` xay-da 
xay-day 
Koibal kem, kim nô, néma, nime kai-ze, kaden kai-da 
(Khakas) kai-dak 
Fuyu gim nem cay-zi, ca-dah cajan cay-da 
Shor kem noo kaj(y) qacan kaj-da 
Sarig Yughur_ kim ni qay-si qa"tan ` qay-ta 
Altai kem d'u-y-écu-y,ne  kan-dyj kacan kaj-da 
Chalkan kem t'u(u), t'ü-y - qan-diy, qazan qay-da 
t'u-y/g, ne qan-du(y) 
Chulym kim do nóómà qay-diy qacan ~  qay-da 
qajan 
Yakut kim tuo-x xaya, xay-dax xahan xanna 
Dolgan ki(i)m tuo-k- tuo-gu kaja, kaj-da(aą)ìk ^ kahan- kanna 
kagan 
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Table 5.139: A tentative list of cognates of a possible interrogative stem in 
Siberian Turkic (except for Abakan); not all forms are shown 


what why how many 
Altai d'u-y ~ éu-y 
Dukhan jà(ü) 
Chalkan t'u(u), t'ü-y ~ t'u-y/g 
Chulym tfio 
Karagas (Tofa) tü tü-gä, tü-dän teśe, tehe 
Tofa čü čehe 
Tuvan čüü čü-ge, čü-den čeže 
Tuvan, Dzungar dzy-dzime dzy-ge, dzy-nen dzeze 
Dolgan tuo-k ~ tuo-gu to-go ~ tuo-go tóhó 
Yakut tuo-x to-yo tóhó 


instead of t-. Most likely this is the result of the following high vowels. The reason for 
the apparent irregular development in Yakut tuoy and Dolgan tuok is not perfectly clear, 
but one possibility would be an analogy to the negative existentials, Yakut suoy and Dol- 
gan huok, that have a regular development. Possibly, the question marker =duu - =duo 
in Yakut and Dolgan derives from the same source (INT > Q), but this likewise remains 
somewhat speculative. More research by experts of these languages will be necessary to 
clarify these points. 

For Old Turkic, several partial interrogative paradigms are attested. Table 5.140. com- 
pares some of them with Sarig Yughur, for which paradigms were given by Roos (2000). 
Apart from phonological changes there are only minor differences between the two lan- 
guages, which illustrates the relatively young age of Turkic. 

For reasons of space, paradigms will not be given in detail for other languages through- 
out this section. 


Table 5.140: Old Turkic interrogative paradigms (Erdal 2004: 211) in comparison 
with Sarig Yughur (Roos 2000: 87) 


who which 
Old Turkic Sarig Yughur Old Turkic Sarig Yughur 
NOM kim, kam Hmm kayu, kanu qay-si 
ACC kim-ni kim-ni kayu-ni qay-si-n 
GEN kim-(n)in k*im-nin kayu-nur - 
DAT kim-kà, kàm-kà | k"im-ki kayu-ka qa-ya 
ABL - kim-tin kayu-din qay-tan 
LOC kim-tádà - kayu-da, kafiu-da | qay-ta ~ qan-ta 
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Let us now consider interrogatives from individual modern languages. The order will 
be roughly the same as in §5.11.2, starting with the only Oghuz language Salar (Ta- 
ble 5.141). If available, several descriptions are contrasted for any given language. In 
some cases only a selection of forms is given. 


Table 5.141: Salar interrogatives (Ma Quanlin et al. 1993: passim; Lin Lianyun 
1985: 52, 109, 136, passim); some questionable variants were excluded 


Ma Quanlin et al. Lin Lianyun 
who kam kem 
where gada cada 
whither cala 
whence caden 
when gaqiang, gahao cadzar, cahal 
which gasi cajsi 
for what naima 
what, how etc. naiqi 
how naiqiu 
what nang nar) 
why neigei neye 
how (long) neisi 
how much/many neisiqiu nehdze 


Similarly to Turkish and Tatar, relatively many of the interrogatives start with n~. Only 
kam ~ kem ‘who’ has an initial k, while all other forms start with g ~ c. Lin Lianyun (1985: 
76) in addition mentions an interrogative verb naxcur ‘to do what’ that is claimed to be 
a contraction of nan et-gur with the definite future marker. With other verb endings the 
periphrastic construction is still present. 


(440) Salar 
sen nay et-bar-i? 
2sc what do-PROGR.INDEF-Q 
"What are you doing?' (Lin Lianyun 1985: 86) 


Interrogatives from the Kipchak languages Tatar, Kazakh, and Kyrgyz are listed in 
Table 5.142. The languages all have a similar resonance pattern with the interrogative 
‘who’ being the only one that does not show q- or n-. Tatar is probably no exception, 
although in Cyrillic transcription the forms all start with x~. This has been transliterated 
with q- before an a. Only Kyrgyz emne is an exception from the resonances. Most likely 
it is an allegro form that developed from a form similar to Kazakh nemene. For compar- 
ative purposes, Table 5.142 also contains forms from Tatar proper, transliterated from 
Cyrillic.’ Tatar nerse derives from ni erse (Chen Zongzhen & Yi Liqian 1986: 80). 


35Some forms seem to be pronounced slightly differently, e.g. nindi/uumy was given as /nindey/. 
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Table 5.142: Interrogatives from Chinese Tatar (Chen Zongzhen & Yi Liqian 
1986: 34, 79f., 185), Tatar (Poppe 1963: 81f., 219, 234f. passim), Kazakh (Geng 
Shimin & Li Zengxiang 1985: 53, 103, 172, 238), and Kyrgyz as spoken in China 
(Hu Zhenhua 1986: 58, 251) 


Ch. Tatar Tatar Kazakh Kyrgyz 
who kim kem kim kim 
what ni, nerse nii, närsä ne, nemene emne 
why nik, nige nik, nigä nege, ne yfin emne ytfyn 
how many/much nitfe, qantfa ~ qantfe nica nefe(w), qanfa netfe(n), qantfa 
how nitfik niéek qalaj- qandaj qandaj 
what (kind of) nindij nindi 
which qaj, qajsa qaj, qajsy  qajse qajsə 
where (to) qajda, qaj dzer qajda qajda qajda 
whence qajdan qajdan qajdan qajdan 
when qajtfan qajčan qafan qatfan 


Table 5.143: Uyghur (Tuohuti Litifu 2012: 367; Mi Haili 1997: 83), and Uzbek 
interrogatives (Boeschoten 1998: 373; Landmann 2010: 24) 


Uyghur Kashgar Uzbek Uzbek 
Uyghur (Boeschoten) (Landmann) 

who kim kim ~ tfim kim kim 

where na nee : à 

how much näččä netftfe neéa ~ necha, nechta 
nec(a)ta (attr.) 

what nema nime nima ne, nima 

why nemiáqa ~ nimifqa naga, nimaga, nega, nimaga, 

nemiska náàcük, nima nima uchun 

ücün 

when qacan qatfan qacan qachon 

how qandaq qajdaq ~ qalay, qanday, qani 

qandaq qanaqa 

how much/many  qancá qantfe qanéa qancha 

where qeni, qàyár qeni, qeje(r) qani, qayerda, qayerda, 
qayata qatta 

which qaysi qa(j)si qaysi qay(si) 

whither qayerga, qayerga 
qayaqqa 

whence qayardin qayerdà, qayerdan 
qayaqdan 
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Kazakh ne yfin and Kyrgyz emne yt/yn, like Uzbek nimà ücün, literally mean ‘what for’. 
Plural forms in Kazakh are formed by reduplication, e.g. kim kim ‘who (plural)’ (Geng 
Shimin & Li Zengxiang 1985: 54). This pattern that is also found in Uzbek, for example, 
has parallels in the Amdo Sprachbund. 

Uyghur nd ‘where’, or its dialectal Kashgar form nee, is an innovation also found in 
Eynu ne that might be connected to Turkish nere and Khalaj ni*rá ‘where’. Uzbek naga 
‘why’ has cognates in Tatar nigd and Kazakh nege and in some Siberian languages such 
as Khakas noya or Fuyu nozo and is an old dative form. The Uzbek interrogative nimaga 
‘why’ has the same basis but is more readily analyzable as the form nima ‘what’ still 
exists. The dative can also be found in gayérga ‘whither’. Both Eynu and Ili Turki forms 
are almost completely identical to Uyghur (Table 5.144). 

Interrogatives from the Sayan subbranch of Southern Siberian Turkic languages have 
been collected in Table 5.145. Ragagnin (2011: 94) only mentions the three Dukhan inter- 
rogatives gim ‘who’, jü(ü) ‘what’, and gae ‘which’. 

Abakan is the only subbranch of Siberian Turkic that lacks the special interrogative 
that might be cognate with Yakut tuoy. Table 5.146 summarizes all forms available for 
Khakas, Fuyu, as well as Shor and compares them with Sarig Yughur. Sarig Yughur inter- 
rogatives are rather different from other Abakan languages. Altogether there are more 
forms starting with an n-. 

Table 5.147 presents data from Chulym and Altai Turkic languages. Altai has a dialectal 
difference between southern ne ‘what’ and northern d'uy ~ éuy ‘what’ (Baskakov 1958b: 
15). For Chulym, Anderson & Harrison (2006) and Harrison & Anderson (2003) have the 
form tfo for Middle Chulym, but Birjukovich (1997: 493) mentions nömä instead, which 
was given as nöömä by Li Yong-Sóng et al. (2008). Chalkan furthermore has a verb t’uvet- 
‘to do what’ and the Russian interrogative qaqoy ~ kakoy ‘what kind of’. 

Northern Siberian Turkic interrogatives (Table 5.148) have two resonances, t- and k-. 
The latter changed to y- in Yakut. There is no resonance in n-, which stands in stark 
contrast even with several Southern Siberian Turkic languages. Yakut yanna and Dolgan 
kanna ‘where’ are amalgamated forms that go back to a locative form with an -n- instead 
of a -y-, cf. Sarig Yughur qay-ta - qan-ta. The similarity to Mongolic languages such as 
Khamnigan Mongol kaana or Buryat xaana ‘where’ is thus due to chance. Yakut yas and 
Dolgan kas ‘how much’ seem to have a cognate in Tuvan qas. 
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Table 5.144: Ili Turki (Hahn 1991: passim) and Eynu interrogatives (Lee-Smith 
1996a: 857; Zhao Xiangru & Aximu 2011: 79f., 316, 338). 


Ili Turki Eynu (LS) Eynu (ZA) 
who kim kim ki 
how many näččä netftfe 
what nemä nime neme, qaj 
where ne ne 
when qačan qatfan qatfan 
how much/many qanča qantfe 
how qandaq qandaq qandaq 
which qaysı qajsi qajsi 


Table 5.145: Russian Tuvan (Anderson & Harrison 1999), Dzungar Tuvan (Wu 
Hongwei 1999: 42, 231), Tofa (Rassadin 1997: 381), and Karagas (Tofa) interrog- 
atives (Castrén 1857b: 23, 163ff.); according to Schönig (1998: 410), the Tuvan 
form for ‘who’ is qïm; some variants were excluded 


Tuvan Dzungar Tuvan Tofa Karagas 
who kim cəm kum kèm, kum 
what čüü dzy-dzime cu tü 
to do what éoon- 
why čü-ge, čü-den dzy-ge, dzy-nen qančža ‘how’ tü-gä, tü-dän, 

tüner usun 
how many čeže dzeze éehe, čü hure, teśe, tehe 
qaš 
how, which kandig candəy qandyy kandeg 
which kayi ~ kay(i)zi caj(ə)sə qajsy kaja 
where kayda cajda qajda kaida 
whither kayaa ~ kainar 
kaynaar 
whence kayiin ~ cajlap qajdan kajen 
kayiirtan 
when kazan cazan kasan, kahan 
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Table 5.146: Khakas (Anderson 1998: 21), Koibal (Khakas) (Castrén 1857b: 23, 
163ff.), Fuyu (Hu Zhenhua & Imart 1987: 31), Shor (Donidze 1997: 505), and Sarig 
Yughur interrogatives (Roos 2000: 87, modified transcription); Shor forms in 
brackets are from Nevskaja (2000: 294) 


Khakas Koibal Fuyu Shor Sarig 
who kem kem, kim gim kem k'im 
what nime nó, némá, nem noo ni 
nime 
why noya nódar, nogo ~ noo nayo, nati 
nô kerektan, 
nimedan, 
nóderga 
when xayzan kaden cajan [qaéan] qabtan 
how xaydi kaidi ~ kaidi niyor 
how much/ ninze, xanza nemde ninji nici, nicor 
many 
where xayda kaida cayda kajda, qhan 
[qayde] 
which, how xay, xayzi kaize, cayzi kaj(y) qaysi 
kaizeder 
whither xayya,xaydar kaidar [qayaya] qay-ta ~ 
qan-ta 
whence xayday kaidan qay-tan 
what kind of xayday kaidak cadah ~ nicik 
cadih 


Table 5.147: Middle Chulym (Li Yong-Sóng et al. 2008: 44; Anderson & Harrison 
2006; Harrison & Anderson 2003), Altai (Baskakov 1997: 183), and Chalkan 
interrogatives (Erdal et al. 2013: passim). Not all variants listed. 


Chulym (Li)  Chulym Altai Chalkan 
who kim kem kem 
what nóómá tfio ne t'u(u), t'ü-y ~ t'u-y/g, ne 
what kind of, how kazy qaydat, qayde, qaydeet, 
qayt(a) 
which, how qaydiy qajdury ` kand!  qandiy, qandu(y), qanduu 
why qaya qaja qay ‘how’ 
when qacan ~ qağan  ?qajya kaéan ` qaZan 
where qayda kajda qaya, qayda 
whither qaynar kajnaar qana(a), qayda 
whence qaydin kajdin 
how many/much ` kópà kanéa ` qanZa, qanZi, qanéa, 


qanéi 
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Table 5.148: Selected Yakut and Dolgan interrogatives (Stachowski & Menz 
1998: 423; Stachowski 1993: passim) 


Yakut Dolgan 
who kim ki(i)m 
what tuox tuok - tuogu 
why (-DAT) toyo togo ~ tuogo 
how much tóhó tóhó 
when (-DAT) yahan kahan ~ kagan 
how xaydax kajda(a)k ~ kajtak 
how much xas kas 
which xaya kaja 
where, whither yanna kanna 
whence xantan kantan 
along where kanan 
how much xyahya kahya(n) 
what kind of xyannik kannyk 


5.12 Uralic 


5.12.1 Classification of Uralic 


Leaving aside the possible existence of so-called Para-Uralic for which no direct evidence 
is available, Uralic may be classified as follows (Janhunen 2009: 65). 


(441) Uralic — Samoyedic 
Finno-Ugric — Mansic 
Finno-Khantic — Khantic 
Finno-Permic — Permic 
Finno-Volgaic — Mariic 
Finno-Saamic — Saamic 
Finno-Mordvinic — Mordvinic 
Finnic & Para-Finnic 


Uralic is usually divided into two main branches, Samoyedic and Finno-Ugric, the 
latter of which shows strong internal diversity and can be classified into about seven 
subbranches. However, only the Samoyedic branch (e.g., Janhunen 1977; 1998; Hajdú 
1988) will be treated here. Janhunen (1998: 459) mentions two possible classifications of 
Samoyedic languages which he calls the conventional and the alternative classification. 
Both classifications share the assumption that Enets and Nenets as well as Selkup and 
Kamass are relatively closely related, but differ in whether Nganasan and Mator should 
be granted a separate status or not. As for Enets and Nenets, the focus here will mostly lie 
on Tundra Nenets and Forest Enets, mostly excluding other dialects. A language called 
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Yurats probably was a transitional vernacular between Enents and Nenets and will be 
excluded for lack of data (Janhunen 1998: 457). 


5.12.2 Question marking in Uralic 


Marking strategies for polar questions have been surveyed by Miestamo (2011) for all of 
Uralic (Table 5.149). In general, Uralic languages form a relatively clear western border of 
Northeast Asia. Marking with initial, second position or preverbal particles, question af- 
fixes, and question word order are all features that set Uralic apart from other languages 
in Northeast Asia. Some of these features such as word order for marking polar questions 
rather have affinity with European languages, especially Germanic (§§4.2.1, 5.5.2.1). 


Table 5.149: Polar question marking strategies in Uralic (adapted from Mies- 
tamo 2011: 8); Int. = Intonation, IP = initial particle, PP = preverbal particle, 
FP - final particle, 2ndC - second position clitic, WO - word order, AnA - 


A-not-A 
Int. IP PP FP Clitic 2ndC WO Affix AnA 

Estonian + + + + 
Finnish + + 
Veps + + + 
Votic + 
Central-Southern Saami + + 
Northern Saami + + 
Skolt Saami + + + 
Mari + 
Erzya Mordvin + 
Moksha Mordvin + 
Komi-Zyrian + + + + 
Udmurt + + 
Hungarian + + + 
Khanty + 
Mansi + + 
Enets + + 
Kamass + 
Nenets + + 
Nganasan + + 
Selkup + 


As we will see in this section, not only the marking of polar questions but also the 
semantic scope of the question markers differentiates Samoyedic, especially northern 
Samoyedic, from most other languages in this study. 

The most complex system of asking questions can be found in Nganasan, which has 
recently been described by Miestamo (2011: 17). It is worth quoting his good summary in 
full length. 
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PIs are expressed by the interrogative mood or by intonation alone. The interroga- 
tive mood suffixes are different in different tense-aspect categories (they follow all 
other verb morphology but the person suffixes). In the present (aorist), the suffix 
is -nu/-na, and this suffix replaces the imperfective and perfective aspect suffixes 
used in the indicative present. However, the aspect suffixes mark aspect only re- 
dundantly (and only in the indicative present): the aspect distinction is a lexical 
one and imperfective and perfective verbs differ in their stems as well (except for 
a small number of biaspectual stems) — the semantic distinction is thus not lost 
in the interrogative. In the preterite, the interrogative suffix is -hu/-ha, and it re- 
places the preterite suffix used in the indicative. In the future expressed with -suta, 
the final vowel of the verb (the a of the future marker or the vowel of the person 
suffix) is lengthened if the verb is in final position in the interrogative. The inter- 
rogative iterative marker is -kaa, which differs from the indicative iterative -ka by 
the lengthening of the vowel. The interrogative future may also be expressed by 
-ntanu/-ntana, which is a combination of the progressive aspect suffix -nta and the 
present interrogative suffix -nu/-na; according to Larisa Leisiö (p.c.), the aorist and 
future would differ in the progressive interrogative in that the future would con- 
tain two instances of the progressive marker, but in actual usage, this repetition 
often does not happen and the distinction is then not made formally. The interrog- 
ative renarrative suffix is -ha instead of the indicative renarrative -hamhu, i.e. the 
second syllable of the marker is dropped in the interrogative. Other moods do not 
take interrogative suffixes, although some of them may be used in polar interrog- 
atives. The remote past and the future-in-the-past are used without interrogative 
marking in questions. The interrogative mood can also be used in content questions. 


(my boldface) 


The same markings are present not only in polar and content questions, but also in alter- 
native questions. The first two sentences are negative questions, present and iterative, 
showing that the question markers under negation attach to the so-called negative verb 
—a feature that Uralic shares with Tungusic (e.g., Hólzl 2015a)—rather than the lexical 
verb itself. Example (444) is an open alternative question in which the second of the two 
markers attaches to the interrogative verb. 


(442) Nganasan 

a. maaó toti ra-mta ni-ni-n helisi-’? 
why this partner-Acc.2sG NEG-PRS.Q-2sc help-con 
"Why don't you help this friend of yours?' 

b. tin nona-ntu’ ni-kaa-ri’ ^ houra-’? 
2PLself-2PL NEG-IT.Q-2PL fetch.wood-con 
‘Don’t you go to bring your firewood yourselves?’ 

c. ma-kali i-gu-ru' maa-gu-ru'? 
home-caAR be-PRS.Q-2PL what-PRS.Q-2PL 
"Are you homeless or what?’ (Gusev 2015a: 109, 121) 
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Question marking in Nganasan is markedly different from all other Uralic languages as 
well as from most other languages included in this study. Even from a global perspective, 
it qualifies as one of the more complex interrogative systems. Because of morphophono- 
logical alternations the exact form of the question markers is too complicated to be given 
here in full detail (see Helimski 1998: 489). As one can ascertain from the second part of 
the open alternative question, content questions also display the same question marking. 
When the interrogative has no verbal characteristics, the marking attaches to the verb 
of the clause. 


(443) Nganasan 
tin, maa anti-d’i ama tuj-yu-ru’? 
2PL what so-inf here come-PRS.Q-2PL 


"What have you come here for?’ (Gusev 2015a: 121) 


Polar questions in Forest Enets have final rising intonation, while in content ques- 
tions there is a peak on the interrogative (Siegl 2013: 353). Similar to Nganasan, there is 
a special past tense question marker that appears in both polar and content questions 
and combines with polar question intonation. Except for the past tense, questions remain 
unmarked morphosyntactically. No example for an alternative question was found, al- 
though the comparison with other Samoyedic languages suggests that they probably 
exhibit the double marking type. Forest Enets lacks indirect speech and thus has no in- 
direct questions (Siegl 2013: 198). 


(444) Enets (Forest) 
a. uu mosra-sa-d 
2sG work-PST.Q-2sG 
‘Did you work?’ (Siegl 2012: 402) 
b. kokod to-sa-d? 
whence come-PST.Q-2SG 


"Where did you come from?' (Siegl 2013: 355) 


The past tense interrogative suffix takes the forms -sa, -d'a, -t’a, or Gg, depending on 
the preceding word (Künnap 1999b: 27). Interestingly, while the answer to a past tense 
question of course must also be in the past tense, both the tense suffix as well as the 
position of the agreement marker differ from the question construction. 


(445) Enets (Forest) 
karaul-xud to-d-ud’. 
PN-ABL.SG come-1SG-PST 


‘I came from Karaul (Siegl 2012: 404) 


According to Siegl (2012: 403), this unusual situation of a tense suffix following an agree- 
ment marker is connected with the development of the question suffix. 
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In the Enets and Nenets languages, a new secondary past tense construction based 
on the finite verb and a free-standing auxiliary emerged. Later, the free-standing 
auxiliary merged with the finite verb, resulting in the unusual ordering where tense 
follows personal endings. Although the reasons for this unusual instance of change, 
as well as for the prior tense/aspect system preceding this change, await a more 
thorough investigation and reconstruction, the triggered change resulted in the 
emergence of a new mood which is only used in questions with general past tense 
reference. 


It may be worth noting that, typologically, the situation is similar to Nganasan. In both 
languages there is an integration of question marking and tense (or aspect). But there is 
only one marker in Enets, while there are several in Nganasan, and there is no formal 
identity of the respective markers. 

In Tundra Nenets there is a very similar situation to that in Forest Enets. Polar ques- 
tions display “pitch raising on the penultimate and ultimate syllables, which may make 
the sentence-final vowel longer? (Nikolaeva 2014: 267) Polar, content, and alternative 
questions exhibit the same past tense question marker -sa that has a palatalized dialectal 
variant -s'a and changes to -se before agreement markers (Nikolaeva 2014: 97). An s (s?) 
regularly changes into c (c^) following consonants (Nikolaeva 2014: 20). 


(446) Nenets (Tundra, Taymyr) 
nawor-ma-n? xarwa-da?? 
eat-N-DAT ` want-2PL 


"Do you want to eat?' (Mus 2015b: 90, from Nenyang) 
Interrogatives are either in situ or fronted (Nikolaeva 2014: 266). 


(447) Nenets (Tundra) 

a. xoqman-ca-n^? 
say.what-PST.Q-2sc 
"What did you say?' 

b. pidar’ ti-m xada-sa-r°? 
2sG  reindeer-Acc kill-PsT.o-2sG»sG.OBJ 
"Did you kill the reindeer?' 

c. wera to-sa, n 'i-sa? 
PN  come-PST.Q NEG-PST.Q 
‘Did Wera come or not?’ (Nikolaeva 2014: 95, 265, 267) 


Forest Nenets has the same question marker found in Tundra Nenets and Forest Enets 
and presumably exhibits the same semantic scope. Consider an example of a content 
question in the past tense. 
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(448) Nenets (Forest) 
kunana me-sa-n? 
where  cor-PsT.Q-2sG 


"Where were you?' (Mikola 2004: 115) 


Another way of forming a polar question usually addressed to oneself is the use of a 
dubitative enclitic. The enclitic can also be found in content questions and marks alter- 
native questions if used twice, and thus has the same semantic scope as the suffix. 


(449) Nenets (Tundra) 
a. n'abako-m'i ` tuut-ba-ta-w^h? 
elder.sister-186 come.FUT-COND-DUB 
‘(I wonder) will my sister come or not?’ 
b. s'axah tuut’a=wh? 
when come.fut-DUB 
"When on earth will he come?' 
c. tuku yaxa yor'a-w^h n'ii-w^h? 
this river deep-DUB NEG-DUB 


'Is this river deep or not?' (Nikolaeva 2014: 267, 268) 


The dubitative enclitic usually has the form =m*h but changes to =w‘h after vowels and 
to =(°)h after m. Examples (444) and (449c) of negative alternative questions exhibiting 
negative auxiliaries as second alternatives follow a construction very similar to several 
other languages in NEA. 

An interesting alternative question with a focus that is not on the verb is the following 
in which the verb takes the question suffix. The first alternative precedes and the second 
follows the verb. The second alternative has rising intonation towards the end. 


(450) Nenets (Tundra) 
noxa-m xada-sa-n^ | t’on’a-m? 
polar.fox-Acc kill-Psr.o-2sc fox-Acc 


‘Did you kill a polar fox or a red fox?’ 


As can be seen, there is only one question marker. Probably, this is the result of ellipsis 
of the originally reduplicated verb xada-sa-n*. The following example, which I reanalyze 
as an open alternative question, also has this structure. 


(451) Nenets (Tundra) 
ti-m xada-sa-n?, nan’i namke-m? 
reindeer-Acc kill-pst.Q-2sc other what-Acc 
‘Did you kill a reindeer or what (did you kill) instead?’ (Nikolaeva 2014: 268) 


Tundra Nenets has yet another clitic =t’iq ~ -d'iq absent in eastern dialects that may 
be found in questions but is not a question marker as such. 
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The interrogative clitic is used in questions, most typically, in rhetorical questions, 
but sometimes also information questions. Its function consists in strengthening 
the interrogative force, roughly in the same way as the ‘on earth’ expression in 
English (Nikolaeva 2014: 123) 


(452) Nenets (Tundra) 
an oh m'ina-dam-t'iq? 
whither go-1sG=EMPH.Q 
"Where on earth am I going?’ (Nikolaeva 2014: 123) 


There are typologically comparable emphatic elements in Chukchi and Yiddish ques- 
tions. 

For the extinct language Mator, only two content questions were recorded. Helimski 
(1997: 164) claims that both exhibit a suffix -s possibly related to the past tense question 
marker in Enets and Nenets. Given the fact, however, that both sentences were translated 
into the present tense, this seems rather unlikely. Mator has been extinct for over 150 
years, which is why more information cannot be obtained. 

Unlike all other languages in Northeast Asia, Selkup has a preverbal polar question 
particle derived from the interrogative qaj ‘what’ (Miestamo 2011: 18). 


(453) Selkup (Taz) 
tat qaj qan-na-nti? 
2SGQ  go-co-2sG 


‘Are you leaving?'(Wagner-Nagy 2015: 149, from Kuznecova) 


The interrogative qaj possibly has a Turkic origin (see §5.11.3). Content questions remain 
unmarked. Wagner-Nagy (2015: 142) is not clear whether final rising intonation affects 
only polar or also content questions. 


(454) Selkup (Taz) 
tat qum-it-ip | qajitqo assa apsti-s-al? 
2sG man-PL-ACC why NEG feed-Psr-2sc.O 
"Why did you not give the people any food?' (Wagner-Nagy 2015: 142, from 
Kuznecova) 


According to Castrén (1855: 111), alternative questions display the marker kai in front of 
each alternative (a feature shared with Ket, §5.13.2), and in negative alternative questions 
the second alternative has the form kai asa? ‘or not?’ (Le., qaj ‘what > Q’, asia ‘NEG’, 
Wagner-Nagy 2015). 

Miestamo (2011: 15) analyzes Kamass, extinct since 1989, as having an enclitic polar 
question marker =a. The marker attaches to the verb and does not appear in content 
questions. Alternative questions are marked twice with the marker -bV, like =a given 
with a hyphen but called particle by Künnap (1999b: 35f.). In line with Miestamo’s (2011) 
analysis, it is treated as an enclitic here. In addition, the example contains a disjunctive 
aali ‘or’, which comes from Russian (Joki 1944: 189). 
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(455) Kamass 


a. tan oron tarlééle malal=a? 
2sc hole.rAT skip.GER can.FUT=Q 


‘Can you skip the hole?’ 
b. Sinde dayin amna? 
who there sit.PRs.3sc 
"Who is sitting there?' 
c. man | amnoó'a-bo aalima’an ` Glo xallola'-bo? 
1sG.Loc live.FUT.2PL=Q or  home.GEN people Lat go.FUT.2PL-Q 
‘Are you going to live here at my place or are you going home to your own 
people?' (Künnap 1999a: 35, 36) 


The question marker =bV could have a Turkic origin (§5.11.2), but note that the extinct 
Kott language, according to Castrén (1858), has two question markers â and bo, both of 
which seem to have parallels in Kamass (§5.13.2). 


Table 5.150: Summary of question marking in Uralic 


PO CO AO 
Forest Enets V-sa ‘PST’ id. ?2x id. 
Kamass V-a - 2x =bV + aali ‘or’ 
Mator ? ?-s ? 
Nganasan V-gu/-ga ‘PRS’, V-hu/-ha id. 2x id. 


‘pst’, V-suto ‘FUT’, V-koo 
‘rT’, V-ha 'RENARR' 


Selkup qaj V ‘what>Q’ - 2x qaj V 
Tundra Nenets — V-sa ‘pst’, =w°h ‘DUB’ V-sa ‘PST? 2x V-sa ‘PST’, 2x =w°h 
‘DUB’ 


Table 5.150 summarizes marking of polar, content, and alternative questions. Little in- 
formation on tag or focus questions is available to me, but possibly there is a tag question 
marker in Nenets that has the form -xava ‘is it not so?’ (Miestamo 2011: 16f.). 

In general, northern and southern Samoyedic languages have quite distinct question 
marking strategies. The form and semantic scope of the northern Samoyedic markers 
set the languages apart from most other languages in Northeast Asia. Table 5.151 gives 
an overview of two of the question suffixes in northern Samoyedic and their cognates 
in southern Samoyedic. The Mator suffix -s- has tentatively been added, but its exact 
meaning remains unclear (Helimski 1997: 164). 

As shown in §5.11.2, the Southern Siberian Turkic language Khakas has a similar de- 
velopment from a past tense into a question marker (-3An) that seems to have been 
influenced by Samoyedic. Because of the large geographical distance, the Negidal future 
question marker presumably has no areal connection to Samoyedic (§5.10.2). 
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Table 5.151: Samoyedic tense markers based on Mikola (2004: 115f.) 


PS Nganasan Nenets Enets Mator Kamass Selkup 
PRET "-sà- | -sua-etc. -sa-Q -sa- Q -s- ?Q ? -s-, -h- 
AOR *-ņn(å)-  -gu-Q -1)(a)- mal: -ga -YV-, -8V- eon Ø 


5.12.3 Interrogatives in Uralic 


For reasons of space only limited aspects of interrogatives in Samoyedic can be presented 
here. The interested reader is referred to Mus (2009; 2013; 2015b) and references therein, 
who has given a very detailed description of Samoyedic interrogatives, especially those 
from the northern languages, and in particular those from Tundra Nenets. Unfortunately, 
her description lacks a clear historical or morphological analysis. 

All Samoyedic languages have a resonance in k~ (> x~ in Tundra Nenets), and thus 
have K-interrogatives. Only some languages have what is called a KIN-interrogative 
(e.g., Mator kim, Forest Nenets Kim o), Both features are inherited from Proto-Samoyedic. 
Janhunen (1977: 15, 62f., 69, 75, 91) reconstructs the following Proto-Samoyedic inter- 
rogatives *ki.m(3) ~ *kimä ‘who’, *ku- ‘what, which’, *ku.nà ‘where’, *kä- ‘what, how’, 
*ká.no ‘how much’ , *me ‘what’, and “a.m- ‘what’. Derivations in individual languages, 
the meaning of the stems, and whether the reconstructed forms are as clearly analyz- 
able as indicated by the hyphens, remain extremely unclear, however. The first three 
reconstructions share a resonance in *k- and thus are probably related historically. In 
several languages the initial consonant changed to a fricative in some forms such as For- 
est Enets sän, Tundra Nenets son" that are cognates of Nganasan kana and thus derive 
from *ká.nó ‘how much’. Generally, most interrogatives seen below can be grouped with 
one of these reconstructions. The initial n- in Tundra Nenets namke and Forest Nenets 
nami is prothetic (Janhunen 1998: 466) and the forms are thus derived from *a.m- ‘what’ 
(Janhunen 1977: 15). Note that the 7- only appears in the Central (namge) but not the 
Western (amge) and Eastern dialects (amge) of Tundra Nenets (Mus 2015a: 93). 

Let us now briefly consider the interrogative systems in individual Samoyedic lan- 
guages, starting with Nganasan (Table 5.152). There is only one resonance in k- and only 
the categories of PERSON ("k-) and THING have special forms without this resonance. 
The interrogative maa-d'aa ‘why’ is derived from maa ‘what’ with the help of what ap- 
pears to be an allative. A form syly/cyazy ‘who’, borrowed from Nganasan, is attested for 
Taimyr Pidgin Russian (§5.5.3.3). 

In Forest Enets there is also a resonance in k~. The interrogatives obu ‘what’, Ze ‘who’, 
and sán ‘how much’ do not exhibit this submorpheme, although the latter two histori- 
cally had an initial *k as well. 

The interrogatives meaning ‘where’, ‘whither’, and ‘whence’ have separate forms, al- 
though the first two share a stem ku-, while the last is based on ko-. Instead of obu ‘what’, 
Tundra Enets has the interrogative mi’ (Künnap 1999a: 5) or mii’ (Castrén 1855: 97). For- 
est Enets exhibits an interesting interrogative with the meaning ‘which of two’ that has 
its own paradigms shown in Table 5.154 (e.g., koki-ju? ‘who of us two’). 
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Table 5.152: Nganasan interrogatives (Helimski 1998: 500f.; Kortt & Siméenko 
1985: passim; Castrén 1855: 47, 49, 50, 65, 74); not all variants listed, accents 


removed 

Meaning Helimski Kortt & Siméenko Castrén 

who sili(-yuna) syly sele 

what maa(-nuna) ma’ maa 

why (?-ALL) maa-diaa ma-d'a maajaar) 

when kagge kanga kanay 

how many kano kano’ kana’, kanay 

“the how manieth" ^ kano-mtu(?) kangkoj kanamtua, 
kanagüi' 

how many times kan-ü? kani’ 

which, what kind of ` kua, kuniia, kunie, kurodi, kunie, kurajee 

kuredi kuredi 


where 
whence 
along where 
whither 


how 


kunu, kunJini 


kunlid 
kunji, kunda 


kunji-?ila 


kuninu 
kunida 
kunimenu 
kuni’ 


id. 


kuninu 
kunida 
kunimanu 
kuni’aan, 
kunijaar 


id. 


Table 5.153: Forest Enets (Siegl 2013: 195ff.; Künnap 1999a: 5, 22, 27, 30, 40) and 
Enets interrogatives (Castrén 1855: 76, 81, 82, 90, 91); not all variants listed 


who 

how much 
what 

why (TRSL) 
whence 


where 
whither 
along where 
how 

when 


which, what kind of 


Siegl 
$e 
sàn 
obu 
obu-$ 


ko-ko-d 


ku-ni-n 
ku-? 


kun 
kuna 
kursi 


Kiinnap 


seea 
obu, abbua 
kuhod 


kunne 
kuo’ 


kun’, kud’, ku’on 


kurse 


Castrén 
sio, sie 
senno 
awuo 


kuro, kudo, 
koohoro 
kokohone 
kuu 
kuuno’one 


kuuno’ kurahaane 


kun(n)e 


hooke 
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Table 5.154: Paradigms of the Forest Enets interrogative ‘which of two’ (Siegl 


2013: 198) 
DU PL 
1 koki-ju? =1DU 
koki-ri? koki-ra? 
koki-di? koki-du? 


Apart from other Samoyedic languages (e.g., Tundra Nenets xujum? ‘which of two’, 
Mus 2015b: 79), this interrogative has no functional parallel in Northeast Asia, but in 


Proto-Indo-European *k"oteros (85.5.3). 


Nenets interrogatives exhibit two resonances, one in k- or x-, and another in s’~ or 
$-. Initial *k- regularly changed to x- in Tundra Nenets, but remained stable in Forest 
Nenets (Hajdú 1988: 4). The initial s`- or š- likewise goes back to *k- (cf. Janhunen 1977: 
62f.). As mentioned before, the initial y- is prothetic (Janhunen 1998: 466). 


Table 5.155: Tundra Nenets (Nikolaeva 2014: 50, 265, passim), Forest Nenets 
(Mus 2013: passim), and Nenets interrogatives according to Castrén (1855: 3, 
10, 32, 327); the Tundra Nenets forms in square brackets are from Mus (2013; 


2015b); not all variants listed 


Tundra Nenets Forest Nenets 


Castrén 


who xiib'a kim'a 

which xa-n’a-ni’ ku-na-ni 

where xo-n'a-na ku(-fa)-na 
whither xo-n'a-h ku(-na)-y 
whence xo-n'a-d* ku(-na)-t 

along where [xo-n'a-mna] 

how xe(n)c'er'q kuSe?, kušeł 

to say what xoqman- 

what (kind of) ` xurka 

how many San" šan 

when s'ax'h Saxan, Sajna, Sana 
what size [s'agkar] Sam an 

what (kind of) ` gyomke yami 

why yamke yame?, naminas 


hübea, hibea etc. 
hu-naa-gy 
hu-naa-na 

hu-naa 

hu-naa-d 
hu-na-mna 

hunder, hunzier etc. 
?ha-maan 

hurk(k)a 

sarooka, sambir 
?saha' 

?sarjum, saguna ‘how long’ 
(n)amge(e) 


(gy)amge(e)jemne 


The form meaning ‘when’ is derived from ‘how many’. Tundra Nenets has only one 
form, whereas Forest Nenets makes a distinction into different forms for ‘what (kind 
of)’ and ‘why’. The interrogative namke has the irregular accusative plural nawo (Niko- 
laeva 2014: 25) and exhibits a function similar to xurka. The two forms are sometimes 


interchangeable. 
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(456) Nenets (Tundra) 
xurka/yomke l’ekara-ne° tara-sa? 
which doctor-Ess needed-PsT.Q 


‘What kind of doctor did he work as?’ (Nikolaeva 2014: 261) 
Locative demonstratives and interrogatives in Forest Enets show partly parallel para- 


digms with special morphological markers -n ‘Loc’, -? ‘ALL’, and -å ‘ABr’ that are other- 
wise only known from postpositions (Table 5.156). 


Table 5.156: Demonstrative and interrogative paradigms in Forest Enets (Siegl 
2013: 197, 204); modified analysis 


PROX DIST INT 
LAT au(-?) to-ni-? ku-? 
LOC aku(-xu)-n to-ni-n ku-ni-n 
ABL aku(-xu)-d to-ni-d ko-ko-d 


While all three stems share the same case markers, there are differences in the for- 
mation of the stems that are only insufficiently understood. Siegl (2013: 204) admits that 
the "spatial deixis system of Forest Enets is far from being clear". However, a comparison 
with Tundra Nenets sheds some light on the situation. 

In Tundra Nenets the suffix mai (~ -(x)° ~ -y^) in the selective interrogative xa-n’a-ni 
^is an attributive form (Nikolaeva 2014: 52). The locative usually has the form can Jo, 
the dative has the 2nd and 3rd person possessive form -xah-, and the ablative has the 
form -xad° (Nikolaeva 2014: 62ff.). Apparently, these forms contain an element -xə that 
is missing in the locative interrogatives that simply add the case markers -na ‘Loc’, -h 
‘DAT’, and -d° ‘ABL’, but attach to an element -n’a instead that has been translated as 
‘at, by’ (Nikolaeva 2014: 50). The prolative, found in xa-n’a-mna, usually has the slightly 
different form -man(’)a(h). Apart from the locative forms listed in Table 5.155, Nikolaeva 
(2014: 50) mentions the shorter forms xu-na, xu-h, xu-d^, and xu-mna. This variation can 
also be seen in Forest Nenets, e.g. ku(-ria)-na ‘where’. Forest Enets shows a less clear 
picture, but it can be noted that both the case markers (-n, -?, -d) and stem formations (- 
ni, -xu, -ko) have parallels in Tundra Nenets (-na, -h, -d°, and aa -xa, -ko). The last of the 
suffixes can perhaps be found in Tundra Nenets demonstratives such as e.g., t'uko-xo-na 
‘there’, which seems to correspond to Forest Enets to-ni-n but has different derivations. 
The comparison with Nganasan in Table 5.157 illustrates basically the same pattern. 
Helimski (1998) recorded synchronic variation in Nganasan with (ku-n/i-ni) and without 
the stem extension (ku-nu) as well. 

The Tundra Nenets interrogative stem xa-, mistakenly called an “interrogative pre- 
fix” by Wagner-Nagy (2016: 3204f.), fused with the negative verb n'i-, resulting in the 
complex form xan’a- ‘how not’ (Nikolaeva 2014: 281). The interrogatives xiib’a ‘(to be) 
who’ and namke ‘(to be) what’ may be either verbal or nominal without requiring any 
derivation (see 85.4.3 on Yupik and §5.10.3 on Tungusic). 
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Table 5.157: Paradigms of the locative interrogative in Nganasan and Tundra 


Nenets 
Nganasan Tundra Nenets 
where ku-ni-nu xo-n'a-na xu-na 
whither ku-ni- xo-n'a-h xu-h 
whence ku-ni-da xo-n'a-d? xu-d* 
along where ku-ni-menu xo-n'a-mna xu-mna 


(457) Nenets (Tundra) 


a. (pidar?) xiib 'a-n?-s'?? 
2sG who-2sG-PST.Q 
"Who were you?’ 


b. xiib’a-h teda sawa? 
who-GEN reindeer.3sc good 


"Whose reindeer is good?' (Nikolaeva 2014: 257, 251) 


Full paradigms are not attested but see Mus (2009; 2015b) for a partial list of forms. 

In Tundra Nenets there is an interrogative xaqman- with the meaning ‘to say what’, 
with the verb man- ‘to say’ as a second element (see 447 above). Given the special mean- 
ing, one cannot exclude an areal connection to Kolyma Yukaghir monosod- with the 
same meaning that exhibits the verb mon- ‘to say’ as a first part (§5.14.3). The verb for 
‘to say’ was already similar in the respective proto-languages (Nikolaeva 2006: 274), but 
the mere existence of an interrogative with this specific meaning in NEA is extremely 
rare and might indicate a contact phenomenon. 

The extinct language Mator had a resonance in k~ (e.g., kim ‘who’, kumna ‘how many’, 
kulgu ‘which’, kagan when") and at least one form, amgan ‘why’ (Helimski 1997: passim), 
without it that might be connected with Tundra Nenets (n)amge ‘what’. As in Nganasan, 
Enets, and Nenets, the locative forms seem all to be built on a stem ku-, but no stem 
extension can be found, e.g. ku-na ‘where’, kuna ‘whither’, kuj ‘whence’. Mator kulgu 
‘which’ could correspond to Tundra Nenets xurka. 

The Selkup interrogative system (Table 5.158) exhibits two resonances in k- and q-. 
The form kuti ~ qod seems to have replaced the original form meaning ‘who’. The inter- 
rogatives qaj, and kaindek (and less likely kussan ~ quzan) seem to derive from a Turkic 
source (§5.11.3). According to Castrén (1855: 111), Selkup also has an interrogative kak ~ 
kan ‘how’ that was borrowed from Russian kak/xax. 

Kamass (Table 5.159) has two resonances in g~ and k~, both of which derive from *k-. 
The initial š- in the interrogative meaning ‘who’ goes back to *k- as well (Janhunen 1977: 
69). The individual forms remain largely obscure synchronically. 

In sum, the interrogative systems in Samoyedic display a bewildering diversity of 
forms that in this study is only overcome by Indo-European and Trans-Himalayan. No 
interrogative has been fully preserved in all Samoyedic languages, many exhibit idiosyn- 
chratic derivations, and only a few forms have a relatively wide distribution (e.g., "ku.nàá 
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Table 5.158: Selkup interrogatives from different dialects (Wagner-Nagy 2015: 
152, passim; Castrén 1855: 111, 113, 126); not all variants listed 


Taz dialect Ob dialect Castrén 
what qaj qaj kai 
why qajqo kaitko, kaino 
who kuti qod kud, kod 
when kuššan ~ kuššat quzan kussai, kusal, kunzei 
where kun - kut kun, kut't'an kun, kaigan 
whither kuééa qu, kuéet ku, kaind 
whence kuuni kut’aut, quute kun, kaigan 
how kaindek 
how much kaana 


Table 5.159: Kamass interrogatives (Künnap 1999b: 19, 26, 28; Castrén 1855: 179, 
180, 181, 183, 184; cf. Joki 1944: 145) 


Kiinnap Castrén 
what (a)mbi(i) ümbi 
why, what for ombiile, moo’, mo ümbi ila’, imbi naaman 
who Sindi, Sandi, Sinds, $mdo simdi 
what (else) Sombi 
which, who (of the two)  gi'i gidi, kidi 
which, what kind giga 
where (to) gijen gidigan 
where to giibar gidibir, gidre 
where from gii gidiga’ 
how kado', kədə’ kada’ 
which, what kind kajet, kajet kadet ‘how’ 
when kaamon kaaman 
how many/much kümen, gilzi khümán 
which kümeeggit khiimanit ‘the how manieth’ 


‘where’), which either indicates strong language contacts or, what is more likely, per- 
haps a longer time of separation than the usually accepted 2000 years (e.g., Janhunen 
2009). In comparison, Tungusic, which is estimated to be of more or less the same age 
(e.g., Janhunen 2005), presents a much more coherent picture with many forms found 
throughout the entire family (§5.10.3). For this reason, the above discussion was not able 
to give an adequate overview of historical developments, which only an expert in these 
languages can provide. 
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5.13 Yeniseic 


5.13.1 Classification of Yeniseic 


As we have seen in Chapter 3, the Yeniseic language family differs strongly from most 
other languages in NEA (e.g., Comrie 1981: 61-66; 2003; Anderson 2003; 2006b; Georg 
2008). Today, Ket is the only representative of this language family, but historically there 
have been more languages, including Yugh (extinct since the 1970s), Kott (extinct since 
1850), Assan (extinct since 1800), Arin (extinct since the 1730s), and Pumpokol (extinct 
since the early 1800s) (Vajda 2009a: 470). Several other languages may have existed but 
these are almost entirely unknown. This chapter will thus be focusing primarily on Ket, 
but where possible comparative data will be included from other languages as well, es- 
pecially Yugh and Kott. There are several attempts at a classification. Georg (2008: 153) 
proposes the following: 


r— Ket 
-— Northern — Yenisei-Ostyakic + 
L tYugh 
UU a. ?'Pumpokol r- !Assan 
r- Assanic 4 
— Southern + L— "Kott 
L— Arin 


According to Vajda (2009a: 470), Arin can perhaps be classified together with Pumpo- 
kol. Both approaches agree in the number of languages as well as in a close relation of 
Ket and Yugh on the one hand and of Kott and Assan on the other. While Georg classifies 
Arin with Assan and Kott, Vajda tentatively assumes a connection with Pumpokol. Both 
approaches are well aware of the somewhat unclear position of Pumpokol. For lack of 
sufficient information this chapter will exclude Assan, Arin, and Pumpokol. In addition, 
Vovin et al. (2016), and references therein, have, in my eyes, conclusively shown that at 
least parts of the Xiongnu confedertation in what today is northern China and Mongolia 
must have spoken a Yeniseic or Para-Yeniseic language (cf. Shimunek et al. 2015), which 
indicates that, historically, (Pre-)Proto-Yeniseic must have been located much further to 
the east. 


5.13.2 Question marking in Yeniseic 


Questions in Yeniseic languages have been analyzed by Werner (1995: 155-168), who 
based his approach on V. A. Moskovoj. Unfortunately, his account is rather obscure and 
lacks a proper analysis of the examples. Where possible, the analysis in this subsection 
follows Vajda (2004) and Georg (2007). 

Polar questions in Ket may take a marker -u that usually takes the second position 
in a sentence, which is a marked difference from most other languages of NEA. Werner 
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(1995: 159) claims that =u is a particle, but wrote it attached to other words with a hyphen. 
It is reanalyzed as enclitic here. 


(459) Ket 
to'k-u etam? 
axe=Q sharp 


‘Is the axe sharp?’ (Werner 1995: 159) 


Another polar question marker tam also converts interrogatives into indefinites, e.g. 
tam bíla 'somehow' and functions as a disjunction if employed twice. 


(460) Ket 
bū tam d[u]-i[k]-n-bes? 
3sGQ  3w-here-PsT-move 
‘Did he come?’ (Kotorova & Nefedov 2015: 67) 


In negative polar questions the enclitic =u attaches to the negator ban’ that in this 
case has the unexplained form ban’d. The d might be epenthetic. 


(461) Ket 
tok ban’du etam? 
axe NEG.Q sharp 


‘Is the axe not sharp?’ (Werner 1995: 159) 


Note that the enclitic does not take second position here, perhaps because the negator 
and question marker were reanalyzed as one element. Possibly, the form has to be reana- 
lyzed as ban -du in which the second part might be the unexpected third person singular 
masculine predicative marker (Stefan Georg p.c. 2016). However, both =u and ban du are 
said to highlight the following instead of the preceding word (Kotorova & Nefedov 2015: 
66). 

For alternative questions Ket has borrowed the Russian disjunction ili/umu, used in 
interrogative and non-interrogative contexts, but also makes use of double marking with 
the negative polar question marker put before each alternative. 


(462) Ket (Madujka, Kurejka) 
a. t-a-[i]n-gij, tület on-am usam ili qómat-am? 
PREV-THEM-PST-say berry many-PR.3sG EX or few-PR.3sc 
'Say, are the berries many or few?' 
b. ána qa-du, bóndu hik.biseb, bóndu qím.biseb? 
who big-Pn.3sc.M Q brother o sister 


"Who is bigger, the brother or the sister?' (Kotorova & Nefedov 2015: 122, 
183)°° 


36 Many thanks to Stefan Georg (p.c. 2016) for finding these examples and helping with some aspects of their 
analysis. 
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In example (462b) an alternative question follows a content question (§4.4). Clearly, 
the morphosyntactic behavior of the question marker qaj in the Uralic language Selkup 
that appears once before each alternative in alternative questions has an areal connection 
to Ket bindu (§5.12.2). But while the Selkup marker seems to derive from an interrogative 
of perhaps Turkic origin (see §5.13.3), this is not the case in Ket. In another example only 
one marker is present between the two alternatives. In one case a negative alternative 
question seems to exhibit juxtaposition. 


(463) Ket (Kellog) 
a. uk hu?n hánuna-da bandu uk hi?p hónuna-du? 
2sG.GEN daughter small-PR.3F Q 2sG.GEN son small-PnR.3M 
‘Is your daughter or your son smaller?’ 
b. u bín-ku kénas-ke?t, bín-ku ban? 
2sG self-2sc bright-human self-2sc NEG 


"Are you a real (lit. bright) person or not?' (Kotorova & Nefedov 2015: 199, 
228) 


Content questions in Ket are generally unmarked. Interrogatives may be incorporated 
and thus defocused. Under “object focus" the interrogative áküs ‘what’ takes the form aj 
and under "subject focus" an (Vajda 2004: 88). 


(464) Ket 
a. áküs dó-b-bét? 
what 3r.S-3N.O-do 
‘Just what is she making?’ 
b. da-áküs-[s]-bet? 
3r-what-Ms-do 
"What is she doing? 
c. aj  dá-b-bet? 
what 3r.S-3N.O-do 
‘Just what is it she is making?’ 
d. an ku-[i]n-a? 
what 2S-pst-active.event 


‘Just what happened to you?’ (Vajda 2004: 88) 


Both an and aj are sometimes called question particles (Werner 1995: 156; Kotorova & 
Nefedov 2015: 66), which clearly must be rejected. For Yugh there seems to be the same 
mistake (Werner 1997b: 214). 


(465) Yugh 
an din’e? 
what 1S.pstT.active.event 


‘Just what happened to me?’ (Werner 1997b: 225) 
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Perhaps, an has the same function as in Ket as there are also other forms such as assa 
‘what’ that may also be incorporated and is thus comparable with Ket akus ‘what’. His- 
torically, Yugh assa may go back to *aksa (Werner 2004: 157), which makes it even more 
similar to the Ket interrogative. A cognate of aj does not seem to be attested. 


(466) Yugh 


a. u assa ku-b-bet’? 
2sc what 2S-3N.O-do 


"What are you doing?' 
b. u k-assa-i-get’? 
2sg 2S-what-do 
‘What are you doing?’ (Werner 1997b: 225, 226) 


Apart from an, Werner (1997b: 214) claims that there are several more question mark- 
ers, namely yala ~ yara, atá, and bən’. The status of the first could not be settled, but 
ata most likely is simply an interrogative meaning ‘why’ (§5.13.3), while ban’ is a nega- 
tor. Werner translates the following sentence with ‘or not’, which seems comparable to 
example (463b) from Ket above. 


(467) Yugh 
dilatkat bil’l’a donand’in, ban’? 
children all — ?3.PsT.3.come NEG 


‘Have the children all come or not?’ (Werner 1997b: 225) 


For marking polar questions, Yugh had in addition an unspecified intonation pattern 
(Werner 1997b: 225). 

Even less information than for Yugh questions is available for Kott. But apparently, 
Kott has a final question marker that is analyzed as enclitic here. 


(468) Kott 
hus=bo? 
horse=Q 
‘(Is it) a horse?’ (Castrén 1858: 153, Werner 1997a: 80) 


For lack of further examples the semantic scope of =bo remains unclear. Alternative 
questions seem to take two markers (A=bo B- bo), although no example was provided by 
Castrén (1858: 153). Most likely, =bo, like the marker =bV in the Uralic language Kamass 
(85.12.2), has been borrowed from a Turkic source (§5.11.2). Castrén (1858: 154) further- 
more mentions the Kott question marker 4. There is no information on its morphosyntac- 
tic behavior or exact function, but it might be connected with the Kamass polar question 
marker =a. Apparently, Russian Jm has also been borrowed. There is no example for 
a content question from Kott. 


37A connection with a Mongolian question tag (eg. Dukhan hala ~ haran) seems too far-fetched (85.8.2). 
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Table 5.160: Summary of question marking in Yeniseic 


PO CQ AQ 
Ket #A=u, £A bóndu, £A tam - (bándu) A bándu B, ili ‘or’ 
Yugh - - ? 
Kott =bo#, ?à ? 2x =bo# 


5.13.3 Interrogatives in Yeniseic 


The Yeniseic interrogatives strongly differ from those found in other languages of North- 
east Asia. Especially the large amount of forms meaning ‘who’ and ‘what’ is exceptional. 
Ket additionally has analyzable forms such as ásés bi’ ‘what kind of thing’ and ásés ke’t 
‘what kind of person’ (Vajda 2004: 32). The existence of special female and male forms 
of the personal interrogatives is unique but has some typological parallels in the Indo- 
European selective interrogatives (85.5.3). Ket and Yugh interrogatives usually start with 
a~ or with b~, which has no clear parallels in NEA, but in Burushaski, for example (Yosh- 
ioka 2012). It may be remembered that this is first and foremost a typological classifica- 
tion and does not necessarily indicate a genetic connection. The interrogative systems in 
Ket and Yugh are certainly similar to each other and show some direct cognates (e.g., Ket 
bisén, Yugh bisa’:n ‘where’) and identical categories (e.g., ‘who.sc.F’ vs. ‘who.sG.m’). But 
there are several striking differences (marked with boldface) that suggest a considerable 
time of divergence. 


Table 5.161: Ket (Vajda 2004: 31, 41f., 88) and Yugh interrogatives (Werner 1997b: 
10, 98f., 103, 211, 214, 226); the Ket forms in square brackets are from Georg 


(2007: 167) 
Meaning Ket Yugh 
who.sG ana ~ ánét aneit, anet ‘who.sG.m’ 
who.PL anét-an, bílàgsan asein, ase:n 
what ak(u)s, an, aj assa, an 
how many anun, [bilon] an'ej(a), birejo":n, biron 
which, what kind of ásés, as aseis, as'eis'(i) 
when áskà aëke 
why aksdint ata, asesan 
who.sG.F bésà asera 
who.sG.M bítsé aneit, anet 
where bisér bisab:1n 
whither biltan, [biles] bire!s, birch 3 
whence bilil, [bili(ga)l] biri:r, bira:r 
how bila, [bilunon] birej 
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According to Georg (2007: 165), the form bílà-r-s-an *who.Pr' can be analyzed as 'how- 
PL-N-PL’ with an unexpected plural marker -an, but a development from ‘how’ to ‘who’ is 
extremely unlikely. According to Vajda (2013: 89), Ket bi-l-es and Yugh bi-r-e":5 ‘whither’ 
can historically be analyzed as ‘which-poss-open.space’. Diachronically, the actual stem 
thus may not be bil- (Georg 2007: 167), but bi-. 


Table 5.162: Paradigm of the Ket locative interrogative (Georg 2007: 167) 


SG PL 

1 biser-di, biser-am (N) biser-day 
biser-ku biser-kar 
biser-du (m), bisern-da (F) biser-ar 


Interestingly, Werner (1997b: 226) also mentions the Yugh forms bi-da ‘where is it/ 
she?’ and bi-du ‘where is he?’ that seem to show a gender contrast. This is comparable 
with the Ket forms bisen-da and bisen-du (Table 5.162) that are based on an extended 
stem (cf. Yugh bisa^:r ‘where’). Perhaps, Ket bilon ‘how many’ is based on the European 
pattern (e.g., Russian kak mnogo/xax wnoro), cf. Ket bílà 'how' and òn ‘many, much’. Note 
that Yugh, apart from birən, has a more transparent form birejo":n. Ket áküs ‘what’ has 
an abbreviated variant ak(u)s that “must be quite old and stabilized, since the retention 
of phonetic [k] in the longer variant can only be understood as a remodelling [sic] after 
the former.’ (Georg 2007: 82, fn. 92) This is the basis for aks-dint(a) ‘why’, which exhibits 
an adessive marker (Georg 2007: 166). 

For Kott there is an extensive description by Castrén (1858) that has been elaborated 
on by Werner (1997a). The Kott interrogative system (Table 5.163) also has a resonance 
in b~ but only one form starting with a- and also has the form helem ‘when’ as well as 
Sena or sina ‘what’ that deviate from this pattern and are perhaps unrelated to the other 
forms. They do not appear to have been borrowed from any known language. 

Reduplication expresses indefinite meaning, e.g. bili bili ‘somewhere, everywhere’ 
(Castrén 1858: 150). The complex interrogative sena ójar ‘why’ is a transparent combi- 
nation of Zeng ‘what’ and ójar) ‘because, for’ (Castrén 1858: 202). Following Vajda (2013: 
88), one may identify a stem bi- such as in bi-l-tun, cognate of Ket bí-I-tàn ‘whither’, 
that goes back to Proto-Yeniseic *wi-l-təñ ‘which-poss-path’. The exact analysis of most 
other forms remains uncertain to me as Ket, Kott, and Yugh have a tendency for opaque 
interrogative systems. Their historical analysis can only be accomplished by an expert 
of Yeniseic languages. 

Non-selective Interrogative pronouns in Yeniseic have extensive paradigms of case 
marking (Table 5.164, see Werner 1997b: 98 for Yugh; Werner 1997a: 79f. for Kott). Demon- 
stratives show related paradigms (e.g., Werner 1997b: 97, 103). 

In sum, Yeniseic interrogative systems deviate strongly from those in all other lan- 
guages in Northeast Asia. Apart from the formal differences—there are neither KIN- nor 
K-interrogatives—there are unusual categories such as a gender distinction in the per- 
sonal interrogatives, incorporation, and a large number of different interrogatives with 
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Table 5.163: Kott interrogatives (Castrén 1858: 55, 149ff.) 


Meaning Form 

which, who asix, PL aSig-an 
where bili 

whither biltur 

whence biléan 

how many/much bilipei, bilipéi 
what kind of bilár, PL bilág-an 
which bilituise 

what Sena, Sina, no PL 
why Sena djan/und/undjan 
when helem 


the meaning ‘who’ and ‘what’. 


Table 5.164: Ket singular interrogative paradigms (Werner 1997c: 140) 


Meaning who.masc  who.FEM what who.MAsc who.FEM 

ABS an’a an'a akus' bit'se bes'a 

GEN an'a-da an'a-d(i) akus’-t bit'se-da bes'a-d(i) 

DAT an’a-dana  an’a-dina akus’-tina bit'se-daga bes’a-dina 

BEN an’a-data an’a-dita akus’-tita bit’se-data bes’a-dita 

ABL an'a-dagal an’a-dinal’ akus’-tinal’  bit'se-dagal bes’a-dinal’ 

LOC - - akus’-ka - - 

PROS an’a-bes’ an’a-bes’ akus’-bes’ ` bit'se-bes' bes’a-bes’ 

ADES an a-danta — an'a-digta akus’-tinta  bit’se-danta bes’a-dinta 

COM an'a-(yas'  an'a-(yjas  akus'-as' bit'se-yas, bes'a-yas, 
bits'as' bes'as 

CAR an'a(yan ` an'a-(yan  akus'-an bit'se-yan, bes'a-yan, 
bits'an' bes'an 


5.13.4 Dene-Yeniseian? 


If the Dene-Yeniseian hypothesis (Vajda 2010) has a basis in actual fact, the common 
proto-language must be several thousand years older than Proto-Yeniseic. It is unlikely 
that question markers remain stable over such long time spans. Similarities would be 
expected instead in the interrogative system. There have been previous attempts to cor- 
relate Yeniseic and Na-Dene interrogatives, notably by Werner (2004: 157ff.), but, in the 
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absence of clear cognates and sound laws, any comparison must be preliminary at best. 
For reasons of space and lack of reliable reconstructions, Na-Dene interrogatives can- 
not be dealt with here. Nevertheless, it is a possibility that the type of question marking 
that is somewhat less prone to changes than the actual question markers shows certain 
similarities. Given that the Ket question marker is very different from the surrounding 
languages—Selkup was most likely influenced by Ket—chances are high that it repre- 
sents a relatively old and possibly stable feature. Na-Dene consists of Eyak, Tlingit, and 
the Athabaskan languages. Of course, only a very cursory overview can be given here. 
According to Enrico (2004: 267), Na-Dene languages have a tendency for “clause-initial 
clitic interrogative markers". Perhaps, what Enrico has in mind are sentence initial ques- 
tion markers such as in Slavey (see also 84.2). 


(469) Slavey (Mountain) 
hígolo —fehk'é? 
Q moose 3.shot 
‘Did he shoot a moose?’ (Rice 1989: 1123) 


In Western Apache there are both sentence initial and final question markers that may 
be used independently of each other or combined. 


(470) Apache (Western, San Carlos) 
(ya’) Katie nldzil (né)? 
Q PN  strongQo 
‘Is Katie strong?’ (de Reuse 2006: 57) 


However, Eyak, Tlingit, and Navajo all show second position clitics as does Ket. 


(471) Eyak 

sAsinL=sh? 

?die=Q 

‘Did it die?’ (Krauss forthcoming: 648) 
(472) Tlingit 

lingit=gé x'eeya.áxch? 

PN=Q 2sc.understand.it 

"Do you speak Tlingit?' (Cable 2007: 74, fn. 40) 
(473) Navajo 

dichin-ísh nílj ? 

hunger=Q 2sc.Poss 

"Are you hungry?' (Young & Morgan 1987: 23) 


A difference from Ket is the presence of an overt second position question marker in 
Eyak, Tlingit, and Navajo content questions as well. 
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(474) Eyak 
[dee k'u.tse']-d? 
what meat=Q 


"What meat?' (Krauss forthcoming: 586) 


(475) Tlingit 
daa-sá uwajée | wutoo.o0wü? 
what-Q 3pL.think 1Pr.bought.it 


"What did they think we bought?' (Cable 2007: 69) 


(476) Navajo 
haa-sh yidzaa? 
what-Q 3Acc.3NoM.happened 
"What happened to him?' (Fountain 2008: 33) 


In Navajo, the polar (=ish ~ =sh) and content question markers (=shq’ ~ =sh) partly over- 
lap in form (Young & Morgan 1987: 23). However, some languages such as Slavey and 
Western Apache have unmarked content questions just like Ket. 


(477) Slavey (Hare) 
menj ‘ahet’e? 
who 3.cop 
"Who is it?’ (Rice 1989: 1141) 


(478) Apache (Western, San Carlos) 
hadín dazhé nldzil? 
who very strong 
"Who is very strong?' (de Reuse 2006: 50) 


Independent of the question of whether Yeniseic and Na-Dene are genetically related 
—which cannot, of course, be proven by typology—, Na-Dene shows markedly different 
question marking than most of NEA, except Ket, parts of Chukotko-Kamchatkan, and 
some Indo-European languages. Thus, there is a relatively clear boundary between NEA 
and North America. Furthermore, Dryer (20131) has shown that polar question marking 
in the Americas in general is much less uniform than that in NEA. The frequent sentence 
initial position of interrogatives likewise differentiates Na-Dene and the Americas in 
general from NEA (Dryer 2013k). 
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5.14 Yukaghiric 


5.14.1 Classification of Yukaghiric 


Today, there are two surviving but endangered, or rather moribund, Yukaghiric lan- 
guages called Tundra Yukaghir (Wadul) and Kolyma Yukaghir (Odul). Two varieties 
called Chuvan and Omok are usually included in the list of Yukaghiric languages. Both 
have already disappeared and are not well recorded (e.g., Anderson 2006e). 


r- Tundra (Wadul) 
— Kolyma (Odul) 


(479) 
——— +Chuvan 


— —  TOmok 


According to Nikolaeva (2008), however, "the linguistic status of Chuvan and Omok did 
not much differ from the status of other varieties of Old Yukaghiric, and therefore re- 
ferring to them as separate languages within the larger family to the exclusion of other 
known Yukaghir idoms is unnecessary". Old Yukaghiric is a cover term used by Niko- 
laeva (2008) for those varieties recorded during the 18^ and 19 centuries. Given the 
limited information on languages other than Tundra and Kolyma Yukaghir this chapter 
will be concerned primarily with these two modern languages. 


5.14.2 Question marking in Yukaghiric 


Kolyma Yukaghir makes a difference between polar and content questions in that only 
the latter take morphological marking. Polar questions are either expressed with rising 
intonation or take an enclitic=duu that appears twice in alternative questions as well 
as in negative alternative questions (Nagasaki 2011: 245; Maslova 2003a: 475-478). 'Ihe 
semantic difference between polar questions marked with intonation or the enclitic is 
unknown to me. 


(480) Yukaghir (Kolyma) 

a. omo-s' Soromo-k? 
good-ATTR person-PRED 
‘Is (he) a good person?’ 

b. me-n’oho-j=duu? 
PRED.FOC-fall-INTR.3SG=Q 
‘Did he fall down?’ 

c. igeje CiCegej-gen! ^ ad-i-duu Sasaqa-daj-m-duu? 
rope strech-1MP.3sG strong-INTR.3SG=Q tear-CAUS-TR.S=Q 


'Let the rope stretch! Is it strong (enough), or will he tear it up?' 
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d. kudede-duu oj-l'e-duu? 
kill=Q NEG-be=Q 


‘Have I killed it or not?’ (Maslova 2003a: 475-477) 


The enclitic also exists in the Turkic languages Yakut and Dolgan, where it has the form 
=duo ~ =duu (§5.11.2). The enclitic does not exist in Tundra Yukaghir (Nikolaeva 2006: 
150), which also suggests a Turkic origin. 

Schiefner (1871) published some material of a variety spoken along the Anadyr that is 
closely related to Kolyma Yukaghir but possibly has affiliations with Chuvan (Nikolaeva 
2006: 28). This variety does not appear to exhibit the enclitic. Instead, polar questions re- 
main unmarked and (negative) alternative questions have a disjunction of Russian origin 
(Nikolaeva 2006: 101). The tentative analysis roughly follows Maslova (2003a). 


(481) Yukaghir (Kolyma, Anadyr) 
mot adó kêt’ alí el=két’? 
1sG son come.INTR.3SG Or NEG=COMe.INTR.3SG 


‘Did my son come or not?’ (Schiefner 1871: 92) 


This absence of the enclitic in a variety of Yukaghir spoken further away from Yakut and 
Dolgan is a further indication that it can be traced back to Turkic. Nagasaki (2011: 254) 
recorded a tag question that was formed with the help of Russian da/ya ‘yes’ attached 
to a declarative sentence. 

According to Maslova (2003b: 66f.) polar questions in Tundra Yukaghir are formed 
with the help of the apparently sentence initial particle eld’e. But the particle also appears 
in content questions. 


(482) Yukaghir (Tundra) 
a. eld'e, tide-n mit t'ald'ed'uo el» men -me-k? 
well DEM.INV-FOC 1PL ring NEG=take-TR-2sG 
"Well, haven't you taken that ring of ours?’ 
b. eld’e, neme-le men’-me-k? 
well what-Foc take-TR-FC 


"Well, what have you bought?’ (Maslova 2001: 48, 42) 


A sentence initial question marker indicates a connection with Chukotko-Kamchatkan 
(§5.3.2). However, as the translation indicates, the word eld’e is probably not a real 
question particle. Neither the exact meaning, nor its origin are discussed by Maslova. 
Nikolaeva (2006: 154f.) assumes an underlying stem “el- that could mean something like 
‘good’, apparently unrelated to the negator el= as seen in (482a). 

Maslova (2003b: 66f.) mentions two further particles, the dubitative quolem (formally 
similar to interrogatives starting with quo~) and hesitative ejk. Furthermore, she claims, 
“if these particles are absent, the verb takes the Negative marker”. However, on the same 
page she gives an example of what appears to be a polar or focus question that neither 
shows the particles, nor negation. 
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(483) Yukaghir (Tundra) 
tet-ek — Id'ilwej? 
2SG-FOC PN 
‘Are you Idilway?' (Maslova 2003b: 67) 


Data given by Schmalz (2012) confirms the hypothesis that unmarked polar questions 
do not have any of the above mentioned particles. Consider the following examples with 
focus marking on a constituent and on the verb, respectively. 


(484) Yukaghir (Tundra) 
a. tet-ek werwe-l? 
2sG-Foc be.strong-N.SFOC 
‘Are you strong?’ 
b. (mórde(n)) me=méri-mk? 
news PRED.FOC-hear-TR.2PL 


‘Have you heard (the news)?’ (Schmalz 2012: 69, 71) 


Presumably, polar questions can be indicated with intonation only, as is also possi- 
ble in Kolyma Yukaghir. This suggests a connection with some Chukotko-Kamchatkan 
languages (85.3.2). 

The proclitic me= seen in (484b) can also be found in questions with a denominal verb 
"to ask for mere confirmation of already known information" (Schmalz 2012: 88). This 
indicates a functional similarity to tag questions in English. 


(485) Yukaghir (Tundra) 
me-brigadir-ro-d'ek? 
PRED.FOC=team.leader-be-INTR.2SG 
‘(You) are the team leader, (aren't you)?’ (Schmalz 2012: 88) 


Alternative questions also differ from Kolyma Yukaghir in that Tundra Yukaghir uses 
a disjunctive connective ejk, identical to the alleged hesitative ejk. 


(486) Yukaghir (Tundra) 

uo purie-le ejk samnaldann’e-le aptaa-nu-m? 

child berry-Acc or mushroom-Acc gather.INCH-DUR-TR.3SG 

‘Is the child picking berries or mushrooms?’ (Schmalz 2012: 83) 
It is difficult to decide from the limited data whether ejk has to be analyzed as a dis- 
junction or as a single question marker, which is a possible marking pattern in some 


languages. Schmalz presents one instance of yet another possible disjunction, uuri, of 
unknown origin (Nikolaeva 2006: 445). 


(487) Yukaghir (Tundra) 
tet ile me-éaal'-uon' uuri me-n'aawe-j. 
2sG reindeer PRED.FOC=be.bay-INTR.3SG or — PRED.FOC-be.white-INTR.3sG 
‘Is your reindeer bay or white?’ (Schmalz 2012: 88) 
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Yukaghiric content questions are more complicated than other question types and 
involve morphological marking on the verb. In Kolyma Yukaghir there is a split between 
three different paradigms. Special interrogative marking is the default choice, except 
for so-called intransitive subjects (better called S) and direct objects (better called O), in 
which case focus marking is employed (Nagasaki 2011: 245). This distribution has certain 
ergative characteristics, but within focus marking two paradigms exist for transitive (so- 
called me-participle, Tables 5.168 and 5.169 below) and intransitive verbs (J-participle, 
Tables 5.166 and 5.167 below) (Nagasaki 2011: 240). A questioned A (transitive subject) 
requires no focus or agreement marking. 


(488) Yukaghir (Kolyma) 


a. tet qanin kelu-k? 
2sc when come-2sc.Q 


"When do you come?' (questioned peripheral argument) 


b. kin-tek kelu-l? 


who-PRED.FOC come-PTCP 
"Who came?’ (questioned S) 
c. tet lem-dik 003e-t-mo? 
2sG what-PRED.FOC drink-ruTr-PTCP.2sG 
"What will you drink?' (questioned O) 


d. kin kudede? 
who kill 


‘Who killed (it)?' (questioned A) (Nagasaki 2011: 245, 240) 


In Kolyma Yukaghir, interrogatives either stand sentence-initially or remain in situ 
(Maslova 2003a: 481). Sentence-initial position of interrogatives in NEA is rare, but can 
also be found in Evenki (§5.10.3). Note the additional predicative focus marker -(le)k 
(which has a special form on these two interrogatives) that is included in the case par- 
adigm by Maslova (1997: 459f, 2003a: 88). It appears on nominal predicates as well as 
on intransitive subjects (S) and direct objects (O) and is thus not restricted to questions 
(Nagasaki 2011: 227). According to Maslova (1997: 459) it is zero marked on “third person 
pronouns, proper nouns, and possessive NPs”. 

Basically the same pattern of content question marking was already in place in the 19th 
century, as can be seen from the following sentences given by Schiefner (1871) for the 
variety already encountered above. Again, the tentative analysis tries to follow Maslova 
(2003a). 


(489) Yukaghir (Kolyma, Anadyr) 


a. kanin kawe-i-ta-je-k? 
when go-?Prv-FUT-INTR-2sG.Q 


"When will you leave?’ (questioned peripheral argument) 
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b. kin-ak kallu-l ta? 
who-PRED.FOC come-PTCP there 
"Who came over there?' (questioned S) 
c. mit lomdak aa-ta-m? 
1PL what-PRED.FOC do-FUT-PTCP.1PL 
"What will we do?’ (questioned Oy?? 
d. kin ólo? 
who steal 


"Who stole (it)?' (questioned A) (Schiefner 1871: 101, 92) 


The focused interrogative kin-ak seems be closer to Tundra kin-ek than to Kolyma kin- 
tek. However, this could also be an artifact of the recording. 

Content questions in Tundra Yukaghir are better understood than polar questions and 
exhibit a close affinity to those in Kolyma Yukaghir. There are verbal suffixes that "are 
only used in specific [i.e. content] questions to peripheral constituents" (Maslova 2003b: 
20). Matié's (2014) summary of how content questions are marked can be seen in Ta- 
ble 5.165. Marking of content questions is thus basically identical to Kolyma Yukaghir. 


Table 5.165: Content questions in Tundra Yukaghir (Matić 2014: 132, modified) 


S/O A Oblique 
agreement S/O focus - interrogative 
marking on interrogative focus case - - 
position of interrogative preverbal or sentence initial position 


(490) Yukaghir (Tundra) 


a. neme-lek joora-t-ook? 
what-1NsT play-FUT-1PL.Q 
"What are we going to play with?' (questioned peripheral argument) 
b. kin-ek | ewra-l? 
who-roc come-PTCP 
"Who has arrived?' (questioned S) 
c. neme-l  feed'i-t-man? 
what-roc tell-Foc-rrc».2sc 
"What will you tell (us)?' (questioned O) 
d. tet-qana kin tite gitríar wee? 
2sG-Acc who so up.to do 
"Who did that to you?’ (questioned A) (Matić 2014: 131f.) 
38Note that Kolyma Yukaghir has ‘Isc’ -me but ‘1pt’ -l (Table 5.169). 
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The focus marker has the form bin) ~ -(a)k (Matić 2014: 131). Maslova (2003b): 8, 52) 
gives the form as -le(n) ~ -(e)k and again includes it in the case paradigm. According to 
Schmalz (2012: 55), -le(n) usually attaches to nouns and -(e)k to pronouns. Interestingly, 
kin(-ek) ‘who’ thus behaves like pronouns and neme(-le) ‘what’ like nouns, which is a 
common cross-linguistic pattern (§4.3). The variant -len tends to be a focus marker and 
-le an accusative (Maslova 2003b: 54). The obligatory combination of focus markers with 
certain verb forms has a typological parallel in Japonic, where a similar phenomenon 
is called kakari musubi (85.6.1). Tables 5.168 and 5.169 exclude paradigms for marking 
of A (transitive subject) as they have almost no special marking; see (488d), (489d), and 
(490d). In Tundra Yukaghir the third person pronouns take the forms tud and titt. The 
verb furthermore remains unmarked except for third person plural -nu (Schmalz 2012: 
56). 


Table 5.166: Focus marking in intransitive clauses in Tundra Yukaghir (Schmalz 
2012: 56); uu(l)- ‘to go’ 


Verb Focus Subject Focus (S) 
1sG met mer=uu-je-1) met-ek uu-l 
2sc tet mer=uu-je-k tet-ek — uu-l 
3sc tudel mer=uu-j tudel | uu 
1PL mit mer=uu-je-li mit-ek ` uu-l 
2PL tit mer=uu-je-mut tit-ek uu-l 
3PL tittel ` mer-uu-pi titte.l uu-gu-l 


Table 5.167: Focus marking in intransitive clauses in Kolyma Yukaghir (Maslova 
20032: 140, 144, 234; Nagasaki 2011: 230); Sohie ‘get lost, disappear’, amde- ‘to 
die'; constructed in analogy to Table 5.166 


Verb Focus Subject Focus (S) 
1sc met m=amde-je-O met-ek  &ohie-l 
2SG tet m=amde-je-k tet-ek ` &ohie-l 
3sc tudel  m-amde-j tudel ^ &ohie-l 
1PL mit m-amde-j-l'i mit-ek ^ &ohie-l 
2PL tit m-amde-j-met tit-ek Sohie-l 
3PL titte.] m=amde-ni titte.] Sohie-ni-l 


The special interrogative verb endings from both languages are collected in Tables 
5.170 and 5.171, comparing them with the declarative endings. The suffixes -m(e) and -je 
that sometimes appear in front of the agreement markers express transitivity and intran- 
sitivity, respectively (Maslova 2003a: 141). For the most part, the paradigms in Tundra 


9?This suffix takes the form -mle if following the future marker -te. 
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Table 5.168: Focus marking in transitive clauses in Tundra Yukaghir (Schmalz 
2012: 56); ai- ‘to shoot’ 


Verb Focus Object Focus (O) 
1sc met mer-ai-r met ai-men 
2SG tet mer=ai-mek tet ai-mer) 
3sc tude mer=ai-m tudel  ai-mele 
1PL mit mer=ai-j mit ai-l 
2PL tit mer-ai-mk tit ai-mk 
3PL tittel | mer-ai-ga tittel ai-yu-mle 


Table 5.169: Focus marking in transitive clauses in Kolyma Yukaghir (Maslova 
20032: 140, 144; Nagasaki 2011: 221, 230); jug- ‘to see, to look at’, aa- ‘to make’; 
constructed in analogy to Table 5.168 


Verb Focus (FUT) Object Focus (O) 
1sG met aa-t-O met jue-me 
2sc tet aa-te-mek tet jue-me 
3sc tudel  aa-te-m tudel  juo-mele?? 
1PL mit aa-te-j mit juø-l 
2PL tit aa-te-met tit juø-met 
3PL tittel  aa-pi-te-m tittel  jue-pi-le 


and Kolyma Yukaghir are extremely similar or even identical. One difference is the pres- 
ence of a first person singular agreement marker -ņ in Tundra Yukaghir that is absent 
in Kolyma Yukaghir. The same difference can be observed in the transitive verb focus 
paradigms (Tables 5.168, 5.169). Furthermore, Tundra Yukaghir has a special second plu- 
ral ending -mk in the transitive paradigm instead of the expected -mut (also compare 
Tables 5.168, 5.169). 

There is the possibility that interrogative agreement forms in Negidal—most unusual 
for a Tungusic language—may be traced back to Yukaghiric influence (§5.10.2). Similar 
to both Kolyma and Tundra Yukaghir, Negidal has special agreement forms for the first 
person singular -m as well as the plural (inclusive) -p, and the third person plural remains 
unmarked. The formal similarity in the singular is accidental, but the typological parallel 
is unlikely to be due to chance. However, Negidal has the same marking throughout all 
question types and combines this with other question markers. 


5.14.3 Interrogatives in Yukaghiric 


Nikolaeva (2006) reconstructed several Proto-Yukaghiric interrogatives. The form “kin 
‘who’ is very similar to forms with the same meaning in several surrounding languages 
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Table 5.170: Tundra Yukaghir non-future endings (Maslova 2003b: 18); for the 
interrogative only future endings are available, showing the additional future 


suffix -t(e) 

TR NONFUT ITR NONFUT Q FUT 
1sc 0 -je-y -te-m 
2SG -me-k -je-k -te-k 
3sc -m-Ø -j-Ø -t-Ø 
1PL -j -je-l'i -t-uok 
2PL -mk -je-mut -te-mut 
3PL -ņa (FUT -gu-te-m) -yi-Ø (FUT -gu-te-j) -pu-t-Ó 


Table 5.171: Kolyma Yukaghir non-future endings according to Maslova (2003a: 
140); alternations of j not shown here include d' and é (Maslova 2003a: 43); 
alternative forms in square brackets according to Nagasaki (2011: 228f.) 


TR NONFUT ITR NONFUT Q NONFUT 


1sc -Ø -je-Ø -m 

2sc -me-k/[-mi-k] -je-k -k 

3sc -m-© -j-O/[-O] -Ø 

1PL -j -je-li [-j(ii)-li] -]-ook/[-uek] 
2PL -met -je-met [-j(e)-met] -met 

3PL -gaa/[-gam] (FUT -gi-te-m)  -gi-O (FUT -gi-te-j) -ni-O 


(the so-called KIN-interrogatives, Chapters 3 and 6). The interrogatives *qa- ‘which’ and 
*qo- (> quo- in Tundra Yukaghir) ‘where’ must be related, historically. They suggest a 
connection between the two categories of SELECTION and PLACE, the latter usually being 
derived from the former. However, as is often the case, a reconstruction of clear-cut inter- 
rogative stems is rather questionable. More generally, Yukaghiric exhibits the common 
K~ resonance present in many languages of the area (Chapters 3 and 6). Proto-Yukaghir ? 
*leme ‘what’ may have started with an “n instead of an "/ (Kolyma Yukaghir leme ~ neme, 
Tundra Yukaghir neme) as did *nogoon ‘what for’. Table 5.172 gives a more exhaustive 
list of forms from the two extant Yukaghiric languages. Most forms start with a q~, only 
a few with n-~ (~ l~) and kin ‘who’ has a special position in both languages. Interestingly, 
the functional distribution of the resonances k-, n-, q- is almost identical to Turkic lan- 
guages (§5.11.3). In contrast to what Nikolaeva's (2006) reconstructions suggest, the two 
Yukaghiric languages share several very specific interrogatives that can be traced back 
directly to the proto-language. 

A difference can be found in the locative interrogatives, i.e. Kolyma qon versus Tun- 
dra qadaa ‘where’. Additionally, while in Tundra Yukaghir case markers attach directly 
to the locative interrogative, the case marker replaces the final -n in Kolyma Yukaghir. 
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Table 5.172: Interrogatives in Kolyma (Nagasaki 2011: 245; Maslova 2003a: 238, 
250) and Tundra Yukaghir (Maslova 2003b: 41) 


Meaning Kolyma Tundra 
Nagasaki Maslova Maslova 

who kin kin kin 

what leme leme ~ neme neme 

what for nooyon noron 

by what numun 

to say what monoxod- 

which qadi qadi 

what (verb modifier) qadinol 

how many/much qamun qamun qabu-n ~ qabu-d 

where qon qo-n qadaa 

to be where qol-l'e- 

whence qot qo-t qadaa-t 

whither qanide qa-ride 

along what route qadaa-n 

how qodo - qode qodo quode 

to be how qodo-l'e- quode-ban- 

at what place qadun qadoon- 

when qanin qanin qan'in 

why qodit qodi-t quodii 

how often qamlid'e qaml'id'e 

to be how qodimie- qodimie 

to be how many/much ` qamloo- qamlal 


Schmalz (2013: 186, 208), in his otherwise excellent grammar of Tundra Yukaghir, ana- 
lyzes the initial q- as the analyzable interrogative stem for the interrogatives in Yukaghir, 


which might be too far-fetched. The resonance in q-, of course, could indicate an origi- 


nal etymological connection, but similarities with demonstratives are perhaps better an- 
alyzed as the result of an additional resonance phenomenon or paradigmatic analogy to 
the demonstratives (e.g., Diessel 2003, Bickel & Nichols 2007). Schmalz (2013: 186, 208) 
also mentions some additional interrogatives for Tundra Yukaghir that have not been 
listed above, such as quodeband'e ‘what kind of’ that he analyzes as quode ‘how’, pan- 
‘to be’, and the participle -je etc. 
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6 Interrogative constructions in 
Northeast Asia: A summary 


Chapter 5 presented a very detailed description of questions in NEA based on a clas- 
sification into language families. This chapter has an areal and typological perspective 
instead. Unfortunately, the information found for almost all languages is insufficient for 
an exhaustive typology. Usually, only the elicitation from a native speaker, the existence 
of a large and modern grammar book or of a specialized description of questions offer 
enough information. Not only is there insufficient information on individual question 
types and on the semantic scope of markers and interrogatives, but most descriptions 
also lack adequate information on intonation. This summary follows the same structure 
as the previous discussion. §6.1 gives an overview of question marking and §6.2 of inter- 
rogatives. A set of 12 maps in the style of the World Atlas of Language Structures (WALS), 
based on a sample of 83 languages for which sufficient information was available, is pre- 
sented in §6.4. However, except for Figure 6.5 (Dryer 20131,j), there is no equivalent for 
them in the WALS. §6.3 evaluates the significance of the grammar of questions with an 
emphasis on language contact. 


6.1 Question marking 


6.1.1 Marking strategies 


Chapter 4 introduced a four-way typology based on the markedness a comparison of 
declarative sentences with polar questions. Table 6.1 and the following discussion is 
based on the sample of 83 languages. The majority of languages belongs to type 4 while 
types 1 and 2 are not attested. Type 3 is found in Central Siberian Yupik, Korean, Jeju, 
and perhaps Nganasan, all of which are located in peripheral regions. Tundra Nenets 
and Forest Enets show a mixture of types 3 and 4 and were thus excluded. 


Table 6.1: Marking of polar questions versus declarative sentences in NEA 


unmarked PQ marked PQ 


marked declarative Type 1: 0 Type 3: 4 
unmarked declarative Type 2: 0 Type 4: 77 


6 Interrogative constructions in Northeast Asia: A summary 


Details of the marking of polar questions are given in Figure 6.5. Altogether 45 out of 
83 languages (about 54%) have a sentence-final marker as the major question marking 
strategy. Deviations can mostly be found in peripheral regions such as Amdo, Korea, the 
Ryükyüan Islands, Chukotka, and the lower Yenisei. In comparison, only 314 (about 36%) 
of Dryer’s (20131) global sample of 884 languages had a sentence-final question marker 
(particle or clitic). If one considers all the languages that have sentence-final question 
markers, including those with additional marking strategies (Figure 6.6), the figure rises 
to 62 (about 75%) languages out of 83. This speaks in favor of an extremely strong areal 
feature of NEA. Dryer’s (20131) map indicates that adjacent areas to the west and south- 
west indeed show less sentence-final question markers. However, there is no such clear 
boundary with MSEA in the southeast. sentence-final question particles are generally 
more common in verb-final languages such as in NEA, but are also common in SVO lan- 
guages such as in MSEA (Dryer 2013a: 274, 277). Concerning this feature, Dryer (20131: 
Chapter Text) discovered “an area within Asia including mainland Southeast Asia and 
extending west into India and north through China to Japan and eastern Siberia”. This 
study has demonstrated that almost all of NEA shares the feature as well. There is a clear 
area around Amdo that extends towards the south and encompasses Trans-Himalayan 
languages from several subbranches; it is characterized by verbal affixes (§5.9.2.2). This 
forms a clear boundary towards the south (see also Dryer's (2013j)). A marked difference 
also exists between NEA and North America (§5.13.4). 

The marking of content questions is shown in Figure 6.7. Altogether, 41 (about 49%) 
out of 83 languages have morphosyntactically unmarked content questions. As opposed 
to polar questions only 15 languages (about 17%) have a sentence-final particle or clitic 
exclusively, but 13 (about 16%) have a morphosyntactic marker. However, by counting all 
languages that have sentence-final markers or affixes among other strategies, the figures 
rise to 27 (ca. 33%) and 18 (ca. 22%), respectively. The first is restricted to the middle part of 
NEA, stretching from Japan in the east to Xinjiang in the west. Regarding the latter, there 
are two possible areas: (1) Koreanic and northern Ryükyüan in the southeast, (2) Yupik, 
parts of Samoyedic, Yukaghiric, and perhaps Negidal (but not Turkic), in the north. 

Information for alternative questions is unavailable for 34 out of 83 languages (Fig- 
ure 6.8). Of the remaining 49 languages, 21 (about 43%) exhibit the double marking 
type, exclusively. These are mostly located in the northern half, but excluding Chukotka 
and Kamchatka. Of the 19 languages with a mixed type, only Plautdiitsch, Yiddish, and 
Urumqi Han Chinese do not have double marking as one of several marking strategies. 
The remaining 16 languages are mostly located in the southern half. In sum, 37 (about 
76%) out of the 49 languages, at least as one possibility, exhibit the double marking strat- 
egy. Selkup and Ket share a unique double marking strategy in which the respective 
markers appear before each alternative. In all other languages the markers follow the 
alternatives. This indicates areal convergence of Ket and Selkup as well as a special po- 
sition of Ket among the languages of NEA (83.5). Altogether, 17 (about 35%) out of the 
49 Janguages contain a disjunction, which may or may not be accompanied by other 
question markers (Figure 6.9). These are mostly located in the southern half of NEA, 
including Korean, Mongolian, Chinese, Russian, Uyghur, Kazakh, and surrounding lan- 
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guages. Focusing only on those languages that have single marking in alternative ques- 
tions (Figure 6.10), there are indications for two areas, (1) a clear area in Amdo with 
possible connections to Xinjiang (Uyghur) and areas to the south, and (2) Yiddish and 
Ukrainian that share an areal background in Eastern Europe. 

For most languages no relevant data was available for focus questions, which is why 
no map was created. There is sufficient information to conclude that focus questions 
in Japonic languages as well as Mandarin tend to contain both a focus and a question 
marker, while in Tungusic, Amuric, and Aleut as well as Old Japanese and Middle Mongol 
the same question marker as in polar questions is employed, which usually attaches to 
the verb in polar questions and to the element in focus in focus questions. 

Morphosyntactic question markers tend to be extremely short, usually just two or 
three phonemes long. This indicates their grammatical function as well as relatively 
high frequency. From genetic and areal perspectives the brevity of the forms represents 
both an obstacle and a possible pitfall: the shorter a given form, the more likely are 
chance resemblances. In fact, it is easy to find identical question markers in languages 
from around the world. But in most cases geographical distance and a lack of interaction 
clearly show that these similarities must be due to chance, e.g. Amdo Tibetan =na, Sibe 
=na, and Ura =na. Question markers usually have a simple form that may be represented 
as (C)V(V), i.e. they consist of a minimum of one vowel phoneme (e.g., Amdo Tibetan 
ə-v) that is optionally preceded by a consonant. In some instances there is a long vowel 
or a diphthong (e.g., Hateruma -naa, Khalkha -(y)UU, Ulcha -noo, Yakut and Dolgan 
-duo ~ =duu, Kolyma Yukaghir =duu). Note that almost all question markers, indepen- 
dent of their morphological status, share this pattern. Only in some rare cases are there 
question markers that do not conform to this generalization (e.g., Ket bóndu, Tundra 
Nenets -w'h, Nganasan V-suta, Alutor matka, Koryak met ke, Xunke Oroqen jooma). In 
most cases it may be surmised that the question marker is of a relatively young age and 
will be subject to phonetic erosion during future developments. In some languages the 
marker may consist of one consonant only. However, while it is true that in Uyghur, for 
instance, the marker may have the form -m, it still preserves the more conservative vari- 
ant =mu as well. In short, the pattern (C)V(V), although not universal, is an extremely 
strong tendency for question markers in NEA and perhaps worldwide. 

Disjunctions tend to be longer and follow no clear phonotactic pattern (e.g., Atkan 
Aleut asxuunulax, Xunke Oroqen aaki, Russian ili, Plautdiitsch óuda, Yiddish odor, Man- 
darin háishi, Kazakh ælde, Sarig Yughur ta" qi, Tundra Yukaghir ejk, uuri). Question tags 
can well be very short (e.g., English eh?, German ge? etc.), but are usually considerably 
longer and less homogenous (e.g., Mandarin dui ma?, dui-bu-dui?, Russian ne pravda li?, 
German nicht wahr?, richtig?, English don't you?, right?, Plautdiitsch es nich zóu?, Sarikoli 
na sou-d-o? etc.). This proves the special position of disjunctions and question tags with 
respect to the domain of question marking. 
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6.1.2 Semantic scope 


A comparison of polar questions with content questions reveals that 61 (about 73%) out 
of the 83 languages have different marking strategies (Figure 6.11). In a global sample 
of 50 languages, Hólzl (2015c) found a comparable figure of 73%. There is evidence for 
one clear area including Japanese, Koreanic, Ainuic, Written Manchu (not shown on the 
map), Kilen, Ulcha (not shown on the map), Dagur, Khorchin (not shown on the map), 
and Ogami, which has the same marking in polar and content questions. These languages 
furthermore also tend to mark focus and alternative questions in the same way. When 
looking at only those languages in which polar and content questions are overtly marked 
differently (Figure 6.12), there are three clear areas: (1) parts of Ryükyüan, (2) parts of 
Mongolic and Turkic, as well as perhaps Kolyma Yukaghir, Middle Korean (not shown 
on the map), and Gyeongsang Korean (not shown on the map), as well as (3) Amuric 
and Uilta. These results clearly prove Levinson's (2012a: 13) rather dubious implicational 
universal wrong: “For all languages that have clear interrogative markers, they mark yes- 
no questions (or polar questions) differently from Wh-questions (or content questions)? 
A similar implicational universal by Siemund (2001: 1019)—“if a language uses a particle 
to mark constituent interrogatives, then this language will also allow the use of this 
particle in polar interrogatives"—had already been disproved by Hólzl (2015c; 2016b: 23). 

The comparison of the semantic scope of polar and alternative questions is severely 
hampered by several problems. For instance, Kazakh as spoken in China has a question 
marker -MA that appears in both polar and alternative questions. However, the latter 
additionally exhibit a disjunction aelde. If the disjunction is seen as a question marker, 
then polar questions in Kazakh are marked differently. If, on the other hand, disjunctions 
are seen as a different functional domain that in some languages combines with question 
marking, the question marker of polar and alternative questions is the same. This study 
decided for the latter alternative. However, Kazakh, like many other languages of NEA 
employs two of the polar question markers in alternative questions. Again, we face two 
mutually exclusive possibilities, but this time no clear solution to the problem is available. 
For 34 languages no information is available for alternative questions. Of the remaining 
49 languages, when excluding disjunctions and neglecting the difference between single 
and double marking, 37 (ca. 76%) exhibit the same marking as in polar questions. Seven 
languages exhibit a mixed type and only five languages (ca. 10%) exhibit different polar 
and alternative question marking (see Figure 6.8 and Figure 6.13). 

A comparison of the semantic maps of all the question marking systems with the help 
of the conceptual space is possible for only a handful of languages. For reasons of space, 
Figure 6.1 only shows a selection of four languages. These and the data above suggest 
that there is a strong dividing line between content questions and polar questions, which 
in turn show affinities with both focus and alternative questions. 84.2.2 introduced a pos- 
sible universal that is shown with dashed lines between content, focus, and alternative 
questions (Content questions are only marked in the same way as focus or alternative 
questions if polar questions are also marked in the same way.) 

The only possible exception to this rule found in NEA is the Ryükyüan language Mi- 
yara from the Japonic language family (Davis & Lau 2015). In this language, if compared 
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2xV-MOOD asxuunulax ‘or’ 


V-MooD (+ #FOC/-(n)un TOP) V-MOOD 


2x-l(ay/-lo háishi ‘or’ 


roc -l(a)/-lo -ņa, -ai(a) ma# + shi roc 


Figure 6.1: Semantic scope of question markers in Korean (top left), Atkan Aleut 
(top right), Nivkh (bottom left), and Mandarin (bottom right) 


with the declarative sentence, both focus as well as content questions lack the indica- 
tive marker. But as further specified in §5.6.2, content questions, like declaratives, have 
falling intonation while polar and focus questions share rising intonation. That the in- 
dicative marker is missing results from the fact that both types of questions, content and 
focus, share a focus marker that is incompatible with the indicative. This is a subtype of 
the phenomenon usually called kakari musubi (focus concord). In the end, Miyara thus 
most likely presents no exception to the universal. The second universal (Focus and alter- 
native questions can only be marked in the same way if polar questions are also marked 
in the same way.) seems to hold for Northeast Asia as well. Of course, both universals 
(or tendencies) can be unified into one form: Focus, alternative, and content questions 
can only be marked in the same way if polar questions are also marked in the same way. 


6.1.3 Interaction of functional domains 


84.2.3 identified the following possible interactions of functional domains (see also Hólzl 
2016b: 24): (1) grammaticalization, (2) combination, (3) fusion, (4) interaction (split types). 
Tables Table 6.2 to Table 6.5 give a list of all instances of these interactions in NEA. 
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Table 6.2: Grammaticalization of question markers in NEA (1) 


Language Type Form Source Domain 
Alutor PQ #matka ?INT 
Koryak PQ #met’ke ?INT 
Ukrainian PQ #ey INT ‘how’ 
Tocharian A ?PQ assi adposition + INT 
Sogdian PQ (a)éu-t(i) INT ‘what-comp’ 
AQ kataar(-ati) INT ‘which-comp’ 
German TO was, wie INT ‘what’, ‘how’ 
TQ oder OR 
Selkup PQ qaj V INT ‘what’ 
Japanese PQ tte QUOT 
PQ no# NMLZ < ?GEN 
Ainu PQ ruwe#, hawe#, siri# NMLZ 
Ogami CO -e£ ?NMLZ 
PQ =tu# ?FOC 
CQ =ka# ?FOC 
Khorchin PQ fii NEG 
Mongolic CQ Khalkha be#, Buryat be# ~ — cor 
=b, Khamnigan Mongol 
bei#, Oirat =w ~ =b, Shira 
Yughur bə# 
Turkic CQ Salar V-i, Tuvan V-Il, Tofa DEM > ?coP 
V-(u)l, Dolgan V-ij, Yakut 
V-(n)ij 
PQ e.g., Turkish =mI NEG > Q 
Mandarin (dialects) PQ ma# ?NEG (NAQ) 
PQ bu#, mei# NEG (NAQ) 
Amdo Tibetan PQ =na ?NEG (NAQ) 
Nganasan PQ, CQ, AQ PRs V-nu/-na, PST TAME 
V-hu/-ha, FUT V-sutə, IT 
V-kəə, RENARR V-ha 
Chukchi, Kerek CQ IMP-V IMP 
Forest Enets PQ, CQ, ? PST V-sa tense 
Tundra Nenets PQ, CQ, AQ PST V-sa tense 
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The origin of most question markers is obscure. Several somewhat unclear cases dis- 
cussed in Chapter 5 such as the Ryükyüan (=na(a) and variants) or the Yakut and Dol- 
gan question markers (=duo ~ =duu) that could be related to interrogatives meaning 
‘what’ were omitted. If Ogami =ka and =tu indeed derive from focus markers, this is 
most likely also true for several other Ryükyüan languages (e.g., Shuri, Tsuken, Tarama, 
Ikema, Irabu, 85.6.2). Mandarin dialects have also not been listed separately. There are 
several possible instances of shared grammaticalization such as the development from 
nominalization to question markers in the Japanese archipelago (Ainuic, Japonic) as well 
as the development of content question markers from copulas in several Mongolic and 
Turkic languages (see §5.8.2 and §5.11.2). 


Table 6.3: Combination of question markers with other functional domains in 


NEA (2) 
Language Type Form Meaning 
Japanese ?FO  ka#+ wa TOP 
Korean ?FQ ` V-woop + -(n)un TOP 
Wutun ?FO  =mu#/=a# + -ha TOP 
Yuwan FQ V-ui + =du FOC 
Shuri FQ =ji# + =du FOC 
FQ V-ra + =ga FOC 
Ikema FQ =na# + =du FOC 
Irabu PO =ru# + =ru FOC 
CQ =ga# + =ga FOC 
FQ 2x =ru + =ru FOC 
Miyara FQ - + lack of IND -N + =du FOC 
Khalkha FQ =(y)UU# + intonation FOC 
Mandarin FQ ma# + shi FOC 
AQ A ne B + háishi OR 
Urumqi Hui Mandarin AQ A nitt B + seis OR 
Hezhou Chinese AQ X ni’, Y, [haishi] Z etc. OR 
Xunke Oroqen AQ 2x =jA + aaki OR 
Kilen AQ =a# + xoci OR 
Kazakh AQ 2x =MA, 2sc V-MI + ælde OR 
Uyghur AQ 2x =mu, NPST V-m + yaki OR 
Sarig Yughur AQ 2x -Mi, =mu# ~ V-m + tahqi OR 


Of course, the exact mechanisms and processes involved in these instances of gram- 
maticalization need additional investigation. Table 6.3 excludes negation and interroga- 
tives. As can be seen, question marking most commonly combines with focus marking 
and disjunctions. Three patterns of fusion have been listed in Table 6.4. 
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Table 6.4: Fusion of question markers with other functional domains in NEA 


(3) 
Language Type Form Meaning 
Ainu PQ an COP 
Shuri NPQ ?a-ran-i COP 
Shirongolic (Mongolic) PO Bonan wu(u), (m)bu, Kangjia COP 
vu, mbu, Santa wu, 
Mangghuer beinu 
Shirongolic (Mongolic) PO V-(TAM.)Q TENSE 
FQ Foc-l(a)/-lo FOC 
Amur Nivkh 
East Sakhalin Nivkh FQ roc-l(a)/-lu FOC 
Atkan Aleut FQ FOC-ii FOC 
Russian FQ #Foc=li FOC 
Even, (Khamnigan) Evenki, FQ FOC=KU FOC 
Udihe 
Udihe FQ Foc=nA FOC 


As can be seen from the entries with question markers in Table 6.5, a large number of 
descriptions fails to mention the criteria for distinguishing between different question 
markers. At least for Manchu it could be shown that it depends in part on clause type 
(see §5.7.2). There is a wide variety of different criteria, but many, like question marking 
itself, are verbal categories (e.g., TAME, agreement, polarity, clause type). 


6.1.4 Borrowing 


Table 6.6 gives a list of borrowed question markers in NEA. Some cases are not abso- 
lutely clear. See §3.1 for the methodology of establishing whether a question marker has 
actually been borrowed. Most cases require an additional evaluation and elaboration by 
experts of the individual languages. 

One of the most widespread markers is Mandarin ba IE. Probably due to its special 
semantics (§5.9.2.1), it is far more likely to be borrowed than a more neutral question 
marker such as Russian (an (mostly used in the written language), and in fact it can 


be found in many languages of China, from Xinjiang to Manchuria. Some very unclear 
cases were excluded. 

In a few cases it is more likely that similarities are due to chance. Gothic, for example, 
has a second position question marker =u as well as a sentence initial question marker 
an. At a first glance, these are surprisingly similar to Ket second position =u and sentence 
initial an ‘what’, but the large distance in both time and space makes a comparison more 


than doubtful. 
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Table 6.5: Split types of polar and content question marking found in NEA (4) 


Language Type Form Criterion 
CSY PQ, CQ . V-MOOD-AGR.Q person, number 
NSY PQ, CQ . V-MOOD-AGR.Q person, number 
Sirenikski PQ, CQ . V-MOOD-AGR.Q person, number 
Kazakh PO =MA#, 2sc V-MI person, number 
Hatoma CQ PST -, ATTR + =wa#, =ja# clause type 
(non-verbal) 
Sonai CO =ga, =ja(a) (non-verbal) + =ba clause type 
SEL, + =du FOC 
Gyeongsang PQ,CQ PQ -na, cop -ka, CQ -no, cop -ko clause type 
Jeju PO. CO  V-mMoop politeness 
Korean PQ, CO  V-woop politeness 
Hezhou Chinese PQ mai, la?, ni?mu? politeness, polarity 
CQ za’, ni?, ni?za? politeness, semantic category, 
gender of speaker 
Japanese PO -kana(a) vs. -kashira etc. 
gender of speaker 
Kolyma Yukaghir CO FOC.CASE, S/OAGR.Q grammatical relations 
Tundra Yukaghir CO FOC.CASE, S/OAGR.Q grammatical relations 
Ogami PQ =ka#, ?=tu, PST, COP, STAT Ver tense, clause type, aktionsart 
Wutun PQ PFV, RES =mu#, IPFV, PROGR =a# tense, aspect 
Nganasan PQ,CQ PRs V-nu/-na, pst V-hu/-ha, FUT tense, aspect 
V-suto, IT V-koo, RENARR V-ha 
Uyghur PO =mu#, NPST V-m tense 
Forest Enets PQ PST V-sa tense 
Negidal PO -Koof + FUT V-MOOD+AGR.Q, =i# tense, person, ? 
Shuri PQ IND V-mi, NEG V-i, PST V-ti, polarity, tense, ? 
=naa# 
Written Manchu PQ,CQ cop =o, =ni, =nio, -nA, =nu clause type, ? 
Masana Okinoerabu PQ IND -), -Ø + =nja# + PST -ti tense, ? 
instead of -ta, =na# 
Tundra Nenets PQ PST V-sa, DUB -w^h tense, ? 
Amdo Tibetan PQ 2-V, =na#, =ni# ? 
Oirat PQ =(y)UU# ~ =ii# ? 
Bonan PQ V-(TAM.)Q, V-si ? 
Hezhou PQ mat, la#, (ni)mu# ? 
Xunke Oroqen PO =jA#, =o0# ? 
Udihe PQ =Ku#, =nA# ? 
Kilen PQ =no#, =a#, -mas ? 
Salar PQ =mU#, =U# ? 
Sarig Yughur PQ =Mi#, =mu# ~ V-m ? 
A Nivkh PQ V=l(a)/=lo ? 
ES Nivkh PQ V=\(a)/=lu ? 
A Nivkh CO -ra, =at(a) ? 
ES Nivkh CO -ra, sf, =ara ? 
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Table 6.6: Possible instances of borrowing and loan translations of question 
markers in NEA. When several dialects have a given form, only one was men- 


tioned 


Type 
Yakut, Dolgan =duo, =duu PQ, AQ ?— Kolyma Yukaghir =duu 
Mandarin ba HE PO —  Mangghuer ba, Bonan ba, Kangjia 
ba, Santa ba, Kilen ba, Sanjiazi 
Manchu ba, Sibe ba, Salar ba, 
Khorchin ba(a), ?Oroqen baa ~ 
baa, Shineken Buryat baa, Solon 
baa, Dagur baa etc. 
Mandarin ma Di PQ — Bonan ma, Kilen ma, Yibuqi 
Manchu mA 
Mandarin háishi i AQ — Yibuqi Manchu xegi, Kilen, 
Oroqen hafi 
Russian ili AQ — Ket ili, Chulym ili, ?Kamass aali, ? 
Anadyr Yukaghir ali 
Nivkh -ya, =ii CQ, PQ —  Uilta -KA, ?=(y)i 
Khamnigan Mongol bei CQ —  Khamnigan Evenki bei 
Russian li/z: PO —  Khamnigan Mongol =li (AQ) 
Koreanic, e.g. Middle Korean -ni, all — Jurchenic, e.g. Manchu -ni, -nio, 
-nyo, -(k)o, ?-nja =o, ?-nA etc. 
Mongolic ‘cop>Q’ (*biti) CQ — Salar -i, Tuvan -Il, Dukhan -II, 
Yakut -(n)1j, Dolgan -ij 
PQ, AQ — Salar -u ~ -o, Oroqen =99, Ongkor 
Mongolian -(y)UU - y.ii Solon =uu - zii, ?Even =i, ? 
Negidal =i, ?Uilta =(y)i 
Dagur =yee PQ, AQ => Oroqen -jz, ?Sibe =ja etc. 
Udihe =nu, (?=nA) PQ, FQ, AQ — Kilen =nə 
PQ ?—> | Oroqen -jOOmAA 
Mongolian youm-aa 
Khorchin jimee CQ ?— Solon yeeme 
Buryat -gü PQ, AQ ?— Solon =gi 
Ewenic (Tungusic) “=Ku PQ, AQ ?— Buryat -gü, Khamnigan Mongol 
Uyghur =mu PQ — Salar, Sarig Yughur, Hezhou, 
Tangwang, Wutun =mu 
Uzbek =mi PQ — Tajik =mi 
Yukaghir AGR.Q CQ ?—.  Negidal (PQ, CQ, AQ) AGR.Q 
Ukrainian cy etc. PQ, AQ — Yiddish ci 
Burushaski or Dardic -a PO ?— Sarikoli =o 
Kamass -a PO ?7— | Kottà 
(Turkic ?—) Kamass -bV AQ ?— Kott =bo 
Hezhou la? Hi PQ — Santa -la 
Old Japanese ya PO ?— Ain va 
Xining Mandarin le?? We CQ —  Kangjia le 


404 


6.2 Interrogatives 


6.2 Interrogatives 


6.2.1 Formal properties 


This study has emphasized on formal properties of interrogatives such as the overall 
shape and the initial sounds. Several language families in NEA exhibit a KIN-interroga- 
tive: the interrogative meaning ‘who’ in a given language has the form KIN (velar or 
uvular plosive or fricative, (high) vowel (short or long), (apical) nasal), followed by an 
optional final vowel, e.g. Turkish kim, Forest Nenets kim’a, Aleut kiin etc. 30 (ca. 36%) 
out of the 83 languages exhibit KIN-interrogatives (Figure 6.14). The phenomenon can 
be traced back over considerable time-spans to several of the proto-languages of NEA 
(Table 6.7). In some instances (indicated with a question mark), the similarity most likely 
is due to pure chance. Itelmen k’e, for example, superficially resembles the other forms, 
but most likely derives from PCK *mikae (Fortescue 2005). 


Table 6.7: KIN-interrogatives in NEA 


Language Form Source 

? Ainuic *gu(n)na Vovin 1993 

Atkan Aleut kiin Bergsland 1997 
Eskimo (ABs.sc) *ki-na Fortescue et al. 2010 
?Indo-European (NOM.SG) *kYí-s ~ "k"ó-s Mallory & Adams 2006 
?Itelmen ke < ?*mikæ Fortescue 2005 
Mongolic *ke-n Janhunen 2003d 
Samoyedic *kim(3) ~ *kima Janhunen 1977 
Turkic *kim, Oghur *käm Schönig 1999 

Uralic “ki ~ "ke Nikolaeva 2006 
Yukaghiric *kin Nikolaeva 2006 


In some of the remaining instances the similarity could in fact indicate long distance 
relationships. However, these limited data cannot, of course, proof any valid genetic 
unity as was assumed by Greenberg (2000: 217-224). Nevertheless, their similarity as 
well as the fact that interrogatives meaning ‘who’ appear to be especially conservative, 
may suggest certain directions that deserve further investigation. The KIN-interrogative 
seems especially promising due to its wide distribution. Related phenomena are often 
restricted to only a few language families, such as Turkic *qay- (e.g., Uyghur qay-, Khakas 
xay-) and Tungusic *Kai ‘what, which’ (e.g., Alchuka kai-, Nanai xar). Such cases can 
often be more readily explained by language contact or chance. 

A phenomenon similar to the well-known m-T-pronouns (e.g., Italian mi, ti, Nichols 
& Peterson 2013) are so-called K-interrogatives: more than two interrogatives in a given 
language start with a velar or uvular plosive or fricative (Figure 6.15), e.g. Nanai xar 
‘what’, xado ‘how many’, xooni ‘how’, Uyghur qaysi ‘which’, qacan ‘when’, qandaq 
‘how’ etc. The consonant must be identical in the different forms. Altogether 39 (ca. 47%) 
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out of 83 languages exhibit K-interrogatives, which speaks in favor of a very strong 
areal feature. While more research is necessary to establish their full geographical ex- 
tent around the globe, at least parts of Eurasia share the phenomenon, e.g. Italian chi 
‘who’, che ‘what’, quale ‘which’, all of which start with [k] (my knowledge), or Bengali 
ke ‘who’, ki ‘what’, kon ‘which’ (Thompson 2012: 202) (see §5.5.3.1). Of course, there are 
also languages outside of Eurasia with K-interrogatives, but a comprehensive treatment 
requires a large cross-linguistic sample. Greenberg’s (2000: 217-224) investigation of the 
alleged “Eurasiatic” interrogative starting with k- overlaps with my notions of KIN- and 
K-interrogatives but is fundamentally different. My categories are first and foremost ty- 
pological in nature and K-interrogatives are only accepted for a given language if at least 
three interrogatives share the same initial consonant. Middle Korean, for example, which 
apparently has only one interrogative starting with h- does not fulfill this criterion and 
therefore has no K-interrogatives. Greenberg (2000), on the other hand, merely assumes 
that the individual forms are all related to each other but does not follow any accepted 
methodology. Greenberg (2000) furthermore does not clearly differentiate between inter- 
rogatives with different meanings but treats them as one category. However, as shown 
in Figure 6.16, personal interrogatives in 43 (52%) out of 83 languages do not share the 
same initial consonant and thus should be kept separate. For instance, in Yukaghiric and 
several Turkic languages the personal interrogative starts with k-, but more peripheral 
interrogatives start with q- instead. This phenomenon is not restricted to NEA, but can 
also be found in other languages, such as the Dravidian language Kurux (Kobayashi & 
Tirkey 2017: 91). In this language, all interrogatives except ne: ‘who’ begin with an e~. This 
result indicates that personal interrogatives (and the category PERSON in general), have 
a very special position and most likely are more stable than most other interrogatives. 
Tungusic, for example, has the interrogatives *niii ‘who’, *ja- ‘what’, and a larger group 
with a resonance *K~. Given that the second and at least some of those interrogatives 
starting with *K~ have most likely a Mongolic origin, the interrogative “niii could repre- 
sent an older layer of the interrogative system. In general, the distribution of KIN- and 
K-interrogatives overlaps with m-T-pronouns and front rounded vowels, which could in- 
dicate an old dispersal of languages in Eurasia that may have had its origin in southern 
NEA (Nichols 2010). 


6.2.2 Semantic scope 


Unfortunately, not much can be said about the semantic scope of interrogatives in NEA. 
Most descriptions are extremely vague about the exact meaning of interrogatives, which 
is why no absolute numbers can be given here, but relatively clear cases of polysemous 
forms have been collected in Table 6.8. Some polysemies are very frequent (e.g., QUAN- 
TITY MASS = COUNT, MANNER = REASON), while others are extremely rare. PERSON = THING 
can only be found in Tocharian B Kee, máksu, and perhaps Ainu ne- or Mongolic "ke- 
The semantic scope of interrogatives gives a specific pattern for every language. Fig- 
ure 6.2 illustrates this with the help of several Mandarin interrogatives. The largest cate- 
gory encompassing THING, ACTIVITY, REASON, and TIME is formed by shénme ‘what’ and 
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its derivations. In Mandarin dialects there are some deviations from this pattern. For 
example, nd- ‘which’, in the form nd-yi-ge ‘which-one-cLF’, has expanded its scope to 
include the category of PERSON as well (§5.9.2.1). 


Table 6.8: Polysemous interrogatives in NEA 


Scope Example 


ACTIVITY = REASON Mandarin gan shénme ~ ganma 


KIND = MANNER . 
Mongolian yamer 


MANNER - REASON Mandarin zénme 

SELECTION = MANNER Tuvan kandig 

SELECTION = PERSON Mandarin dialects na-(yi)-ge 
THING = REASON Sogdian (a)éu 

THING = SELECTION Manchu ya 

PLACE = DIRECTION English where 

PERSON = THING Tocharian B kyse, maksu 
QUANTITY MASS = COUNT Kolyma Yukaghir qamun 


As in this example, innovative interrogative systems are usually based on the cat- 
egories of THING and SELECTION (e.g., Cysouw 2007). In principle, the pattern can be 
given for every language for which sufficient information is available. For reasons of 
space, however, this cannot be accomplished here for all the languages of NEA. How- 
ever, as shown in Chapter 5, the conceptual space clearly is able to capture the semantic 
scope and diachrony of most interrogatives in NEA. 


wei shénme 


gan shénme 


ACTIVITY REASON ` ZNE 
4 : shénme shihou 
. zénme yang 
` zénme 
shénme THING 
PERSON MANNER TIME 
shéi AL 

SELECTION QUANTITY 

inm duosháo, ji + CLF 
ná-li PLACE 
nd- + CLF 


Figure 6.2: Semantic scope of several Mandarin interrogatives 


The semantic scope of locative interrogatives can be shown with an additional con- 
ceptual space. Figure 6.3 illustrates this with the help of Mandarin data. In Mandarin, all 
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three categories are marked with ná-li or its variants, which is, depending on the con- 
struction, combined with verbs or prepositions that derive from verbs. However, there 
are other systems with either identical forms for more than one category and systems 
with synchronically opaque formations. Some languages for which sufficient informa- 
tion was available are compared in Table 6.9. The individual forms may either be related 
to each other or not (e.g., Ukrainian). There are several interrogatives that have a scope 
covering two of the categories (e.g., Japanese, English, Manchu, Mandarin). LOCATION 
appears to have a tendency to be the unmarked member of the group and often serves 
as a basis for derivations (e.g., Buryat, English, German, Japanese, Mandarin). 


(zài) ná-li 
LOCATION 


DIRECTION SOURCE 
(wang) na-li cong ná-li 


Figure 6.3: Semantic scope of simplified Mandarin locative interrogatives 


Many languages in NEA distinguish the three different categories by means of case 
marking or adpositions (e.g., Mandarin, Khakas, Evenki, Nenets, Kolyemal, Amdo Ti- 
betan, Central Siberian Yupik). However, in some instances not all three forms are based 
on the same stem (e.g., Buryat, Kolyma Yukaghir). 

Information from grammar books is usually insufficient to decide about the semantic 
scope of interrogatives expressing QUANTITY (MASS—COUNT). Nevertheless, some clear 
examples can be given in order to illustrate possible patterns (Table 6.10). 

Some languages have only one (e.g., Kolyma Yukaghir), others have two different 
forms (e.g., English, Mandarin, Mongolian). If there are two different forms, these may 
either be related etymologically (e.g., English, Mongolian) or can have a completely dif- 
ferent origin (e.g., Mandarin). In some cases COUNT is derived from Mass (e.g., German), 
which appears to be the unmarked category. In some other cases the semantic scopes of 
individual forms overlap (e.g., Mandarin). The use of any of the forms is usually based on 
subtle differences and the boundary between mass and count nouns is language-specific. 
In principle, the distribution of different types, such as MASS=COUNT vs. MASS#COUNT, or 
SELECTION=PLACE VS. SELECTION#PLACE could be shown on geographical maps, but the 
information for most languages was simply insufficient. 
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Table 6.9: Some examples for the semantic scope in the category PLACE. Only 


a selection of forms and languages is listed 


Language LOCATION DIRECTION SOURCE 
Amdo Tibetan kar-na kar-a kar-ni 
Buryat xaa-(na) xai-sha xaana-haa 
Chukchi mir-ke mir-kori mer-qo(ro) 
CS Yupik na-ni na-vek na-ken 
English where where (to) where from 
Evenki ii-du ii-le ii-duk 
German wo wo-hin wo-her 
Japanese doko (ni) doko e/ni doko kara 
Ket bisén biltan bíll 
Khakas xay-da xay-ya xay-dar 
Kolyemal odi-mee odi-f odi-so 
Kolyma Yukaghir qo-n qa-ride qo-t 
Manchu ai-ba-de ai-ba-de ai-ba-ci 
Mandarin (zài) ná-li (wáng) ná-li cóng ná-li 
Tundra Nenets xa-n’a-na xo-n'a-h xo-n'a-d* 
Ukrainian de kudy zvidky 


Table 6.10: Some examples for the semantic scope in the category QUANTITY 


Language MASS COUNT 

English how much how many 

German wie viel wie viel-e etc. 

Kolyma Yukaghir qamun qamun 

Mandarin duosháo duosháo, ji- + CLF 
xedii xedn 

Mongolian 
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6.2.3 Diachrony of interrogatives 


§4.3 identified seven possible diachronic developments of interrogatives. Of these, the 
convergence of forms is apparently only attested in the northern Tungusic languages 
Orogen and Khamnigan Evenki, where the two Proto-Tungusic interrogatives *ja ‘which, 
what’ *Kai ‘what’ coalesced in a form i(i)- (§5.10.3). The replacement of interrogatives as 
in Italian che > che cosa > cosa ‘what’ or the development of interrogatives from skratch 
do not appear to be very widespread. However, some examples can perhaps be found 
in Tocharian, e.g. PIE *k"i- ‘INT + *so ‘DEM’ > Proto-Tocharian *k”asé > kyse > se ‘who’ 
(§5.5.3.5). The remaining four developments (repeated here in Table 6.11) are more fre- 
quent. 


Table 6.11: The most important diachronic developments of interrogatives 


Schematic Details Example 

1 INT, > INT; phonological PIE *kwód > OE hweet > NE what 
changes 

2 INT; > INT? semantic changes Wutun age ‘which (one) > who’ 

3 INT -Xcram > INT? English where to 
inflection (> fusion) 

4 INT -Xigx > INT; derivation, English how much 
reinforcement (> 
fusion) 


(1) Most languages have a large number of inherited interrogatives. An exception is 
Mandarin, which apparently preserves only the two Old Chinese interrogatives shéi 
(shui) 3& and jï- JL (§5.9.3.1). For details of individual language families, the reader is re- 
ferred to Chapter 5. The loss of the resonance due to phonological changes, which may 
lead to a very different interrogative systems, is only attested for Tungusic languages 
(§5.10.3). 

(2) Semantic changes appear to be quite infrequent, but they are often difficult to de- 
tect because of the lack of data. Some relatively clear examples have been collected in 
Table 6.12, which includes only those cases that do not also involve inflection or deriva- 
tion. For example, Mandarin gan shénme can mean both ‘to do what’ and ‘why’, without 
requiring any additional marking. 

There are too many instances of inflection (3) or derivation (4), which is why Table 6.13 
lists only general patterns illustrated with some examples. Over time many derived in- 
terrogatives fuse, are subject to phonetic erosion, and become unanalyzable (e.g., MHG 
war + umbe ‘where + around’ > German warum ‘why’). 
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Table 6.12: Changes in the semantic scope of interrogatives with some exam- 


ples 
Source Target Examples 
ACTIVITY | —  REASON Mandarin gan shénme ‘to do what, why’ 
MANNER —  REASON Mandarin zénme ‘how, why’ 
PLACE —  MANNER Manchu absi ‘whither, how’ 
—  REASON German woher ‘whence, why’ 
SELECTION — PERSON Solon awu 
THING —  REASON ?Chitose Ainu hemanta ‘what, why’ 
Table 6.13: Some possibilities of inflection and derivation of interrogatives 
Type Examples 
INT + ADJ Japanese do-no ‘which’ etc. 
INT + ADP English where to, Mandarin wang náli ‘id’ etc. 
INT + CASE Evenki ii-le ‘whither’ etc. 
INT + CLF Mandarin ná(-yi)-ge, ji-ge etc. 
INT + CVB Manchu ai.na-me ‘why’, Khorchin joo ga-j, Mongolian yaa-j 
‘how’, yaa-gaad ‘why’ etc. 
INT + DEM Tocharian "kä sé, Slavic *ksto, Lingshi Mandarin ug^ su 


INT + gender 


INT +N 
INT + number 


INT + numeral 
INT + person/number 


INT +Q 


INT + V 


‘ , 


who 
Ket bisen-du ‘where-3sc.m’ etc., German welch- ‘which’, 
Ukrainian kotoryj ‘which’ etc. 

English what kind, Mandarin zénme yang(de) ‘id’ etc. 

Aleut kiin ‘who (sol, kiin-kux ‘who Dou), Eastern kiin-kun, 
Atkan kiin-kus ‘who (PLY etc. 

English which one, Written Tibetan ci.cig, Mandarin na-yi- 
Ket biser-di ‘where-1sc’ etc., Nivkh ja(y).gu- ‘how’, German 
welch- ‘which’, Ukrainian kotoryj ‘which’ etc. 

Korean nwu-ku ‘who’, Tocharian B tāśśi < ta + aśśi ‘where’, 
Amur Nivkh an ?< nar-ga ‘who’ 

English to do what, Manchu ai-na- ‘id’ etc. 
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6.2.4 Borrowing 


Table 6.14 gives a list of possibly borrowed interrogatives in NEA. Several instances that 
are marked with a question mark remain somewhat unclear. Turkic “ne ‘what’, the only 
autochthonous word starting with an n-, is problematic because neither genetic inher- 
itance, nor borrowing appear to be plausible explanations for this anomaly, which de- 
serves further research. 


6.3 The significance of the grammar of questions 


What has been called the grammar of questions in this study is of great significance from 
a number of different perspectives. 


Questions are of interest not merely as interrogative sentences or techniques. They 
are instances of stimuli to which people respond and thus represent a matter of 
broad intellectual interest beyond grammatical and functional concerns. Questions 
entail cognitive and expressive processes, social relationships, and interactional 
discourse. They are also the device by which several enterprises of societal and 
individual significance characteristically proceed. Apart from any relation to re- 
sponse, questions alone are of further interest for their function in the thinking 
of those who ask them—for their motivation of children's thought and scholars’ 
inquiry. (Dillon 1982: 162) 


For example, as seen in $4.4, the internal structure of the grammar of questions allows 
some conclusions about the underlying cognitive structure. The frequent combination of 
content questions with polar, focus, or alternative questions allows an inference on the 
underlying cognitive process that seems to proceed from the schematic to the specific. 


(1) Abui (Timor-Alor-Pantar) 
moku kiang nu he-n-u nala, moku neng re mayol? 
kid baby this 3O.roc-be.like.this-PFv what kid man or woman 


"What is the baby, a boy or a girl?’ (Kratochvil 2007: 175) 


As shown in $4.4, this pattern can be found in languages around the world. This obser- 
vation of a recurrent pattern in languages that are unrelated and lack mutual influence 
suggests a general tendency. In fact, the pattern is in accordance with a hypothesis pro- 
posed by Bar (2009: 1235) 


that the human brain is proactive in that it continuously generates predictions 
that anticipate the relevant future. In this proposal, analogies are derived from 
elementary information that is extracted rapidly from the input, to link that input 
with the representations that exist in memory. Finding an analogical link results in 
the generation of focused predictions via associative activation of representations 
that are relevant to this analogy, in the given context. 
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Table 6.14: Possible instances of borrowing of interrogatives in NEA 


Source 


Target 


Mongolic *kedii- “how many’, *keli 
‘when’, *yaxun - “ya- ‘what’ 
Dagur joonde ‘why’ 

Mongolic *ya- via Tungusic *ja- ‘(to 
do) what’ 

Amur Nivkh rag ‘where’, East 
Sakhalin Nivkh nunt/nud ‘what’ 
Turkic, e.g. Tuvan kay(i) ‘which’, 
kazan ‘when’ 

Solon awu ‘who’ 

Udegheic, e.g. Udihe ni ‘who’, adi, 
‘how’, je’-(u), je-me ‘what kind’, 
Oroch oni ‘how’, Udihe ono-bui 
‘which’ 

Ewenic, e.g. Evenki ni ‘who’, ady 
‘how much’, i-du ‘where’, ee-da 
‘why’, Orogen jEEma ‘which’, aali 
‘when’ 

Ewenic, e.g. Oroqen oki ‘how much’ 
Jurchenic, e.g. Manchu ya ‘which’, 
atanggi ‘when’ 

Hezhou Chinese azi?^go ~ aji?*go 
‘which one, who’ 

Mandarin jídián ‘what o’clock’ 
Sinitic, e.g. Mandarin ma ‘what’ 
Mandarin shénme ‘what’ 
Nganasan sili ‘who’ 

Russian kokda ‘when’, kakoj, kotoryj 
‘which’, kuda ‘where’ 

Russian kakoj ‘which’ 

Russian kak ‘how’ 

Iranian, e.g., Sogdian (a)éu ‘what’ 


Koreanic *e- 
Turkic *qay-, e.g., Uyghur qay- 
Khakas xay- ‘what, which’ 


?7— 


l 


Tungusic *Kadü, *Kaali, “ja(-kun) 


Nanmu Oroqen joonde 
Nivkh ja- 


Uilta saa ‘where’, sado ‘where’, 
nuulu ‘whither’ 
Selkup qaj, ?quzan 


Nanmu Orogen awu 
Kilen ni, adi, ja, yao, iama-, oni, ? 
oni-bici 


Kili rii, adii, i-du, ii-daj, e-ma, ali 


Kilen uki 
Kilen iatin 


Mangghuer ayige 


Santa dzidzian(-da) 

Santa, Kangjia ma- 

Chinese Pidgin Russian Syma 
Taimyr Pidgin Russian syly 
Copper Island Aleut ka(g)da, kakuy, 
katoroye, kuda 

Chalkan qaqoy - kakoy 

Selkup kak - kar 

Siberian Turkic, e.g. Chalkan t'u(u) 
etc. 

Japonic "e- 

Tungusic *Kai, e.g., Alchuka kai-, 
Nanai xar 
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Consider the following example from a language spoken in Eastern Sulawesi. 


(2) Balantak (Celebic, Austronesian) 
ime a men mae’, yaku’ kabai i koo? 
who ART REL go  1sG or PERS.ART 2SG 


‘Who will go, you or I?’ (van den Berg & Busenitz 2012: 66) 


The context of the utterance is difficult to reconstruct. But the content question contains 
an interrogative that represents an initial categorization of a given referent (in this case 
PERSON). The following alternative question represents possible predictions concerning 
the identity of that referent. The choice of the interrogative thus also offers direct evi- 
dence for the most basic categorization and organization of our knowledge. 

The Introduction has claimed that the grammar of questions can function as yardstick 
for measuring the intensity of the intensity of language contact, areal convergence, un- 
usually strong language contact, and simplification. The remainder of this section briefly 
evaluates these claims and argues that in many cases they give valuable and good results. 
On the identification of long-range relationships see §6.2.1. 

Regarding the Amdo Sprachbund, for example, Slater (2003a: 6) observed the follow- 
ing: 


It certainly is true that intense two-language contact situations have resulted in 
many instances of localized contact-induced language change, and I do not mean to 
suggest that two-language comparisons should not be made in the Qinghai-Gansu 
region. However, what has often been lacking is an overview of the regional pro- 
cesses of linguistic feature diffusion. 


In fact, as seen in §3.5, many features mentioned by Janhunen (2012c: 180ff.) such as SOV 
word order fail to define the region as a linguistic area because they are too frequent 
worldwide and in adjacent regions. However, the investigation of the grammar of ques- 
tions has potentially revealed two features that could help define the Amdo Sprachbund. 
Sandman (2012: 384), by comparing two languages, came to the following reasonable 
conclusion. 


In Bonan, the most common interrogative marker is -u. The interrogative marker 
-mu is formed by attaching the interrogative marker -u to the narrative aspect 
marker -m. The narrative aspect marker indicates stative or habitual aspect in Bo- 
nan. The borrowing of the interrogative marker -mu is another example of gram- 
matical borrowing from Bonan to Wutun. 


By just looking at these two languages the conclusion is, of course, very plausible 
because the marker has a clear etymology in Bonan but not in Wutun. However, there 
is another possibility that treats the Wutun question marker as a loan from Turkic, e.g. 
Uyghur =mu, Sarig Yughur =mu, or Salar =mU, perhaps via Hezhou Chinese =mu or 
Tangwang =mu. If this scenario is accurate, the markers in Wutun and Bonan are only 


414 


6.3 The significance of the grammar of questions 


similar by chance. In fact, the question marker -mu in Bonan has parallels in other Mon- 
golic languages of the area such as Mongghul -muu, Santa -mu, and Kangjia -mu. How- 
ever, even if one excludes the Mongolic question marker, the presence of a relatively 
widespread and specific question marker =mu in Turkic and Sinitic languages of the re- 
gion that is absent in the surrounding area is certainly a better defining feature than SOV 
word order. Another example is the presence of single marking on the first alternative 
in alternative questions (Figure 6.10) shared at least by Gangou (not shown on the map), 
Hezhou, Wutun, Santa, Bonan, Kangjia, and Mangghuer. This feature, again, can also be 
found in Uyghur as well as Urumqi Hui Chinese, but not in the surrounding languages 
in NEA. Given the lack of information on alternative questions, this feature might well 
be more widespread in the area. In fact, there is some indication that it can perhaps also 
be found in the immediate south of the Amdo Sprachbund (see 84.2.1). 

The Amdo area, of course, is known to be a region of strong linguistic convergence 
and even creolization, but question marking can also identify contact situations that 
are otherwise hard to detect. It is well-known, for example, that there was contact be- 
tween Koreanic and the Tungusic language Manchu. However, previous studies have 
been quite unsuccessful in identifying any conclusive linguistic evidence for this his- 
torical fact. Vovin (2013a: 224f.) has collected a short but extremely valuable list of 17 
Koreanic items in Manchu, some of which unfortunately are somewhat problematic. In 
my opinion, the Manchu third person pronoun i, for example, more likely derives from 
Mongolic *i Janhunen 2003d: 18), because it shares an identical oblique stem formation, 
e.g. Manchu in-i ‘3sG.0BL-GEN’, Proto-Mongolic *in-U > *in-i “3sG.0BL-GEN’. Pronouns 
are not easily borrowed and perhaps only the contact with Mongolic was strong enough 
(e.g., Doerfer 1985). At least some of his correspondences such as Manchu fucihi "Bud. 
dha’ (from Middle Korean pwüthyé) are very plausible. In fact, the form p‘ut(‘)ihi.n in the 
language Bala makes this even more likely (Mu Yejun 1987), but cultural loanwords such 
as this are not necessarily a sign of direct language contact. This study has identified a 
whole list of question markers in Manchu (and Jurchenic) that appear to systematically 
derive from a Koreanic source. Not only are the Manchu question markers very different 
in form and semantic scope from the rest of Tungusic, but the forms are strikingly simi- 
lar to Koreanic (see 85.7.2, §5.10.2). Such markers can only have been borrowed through 
direct interaction of the speakers of these languages. 

Manchu is perhaps the most aberrant Tungusic language and I have previously put 
forward the possibility that it might even be comparable to languages such as Afrikaans 
(Hólzl 2012; 2015a: 151). In fact, not only the question marking system, but also the in- 
terrogative system is rather different from other Tungusic languages ($5.10.3). While 
Manchu preserves some Tungusic interrogatives, there is a large amount of innovative 
forms that are based on the two stems ai ‘what’ and ya ‘which’, which speaks in favor of 
a certain amount of simplification (Table 6.15) due to massive non-native acquisition in 
the history of Manchu (McWhorter 2007; Operstein 2015). Of course, this theory should 
actually include all of Jurchenic. 

Another striking example is Mandarin, which is also known to have experienced a 
certain amount of simplification with respect to Old or Middle Chinese and other Sinitic 
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languages (McWhorter 2007: 104-137) and contains a large amount of analyzable interrog- 
atives as well (§5.9.3.1). Notice that this is qualitatively different from the contact between 
Manchu and Koreanic, which lead to complexification instead (Table 6.15). Manchu ac- 
tually exhibits more question markers than other Tungusic languages and this must be 
due to influence from Koreanic. Unlike other Tungusic languages, but similar to Korean, 
Manchu also employs the question markers in content questions, which from a certain 
perspective could be interpreted as a type of redundancy. This must be the result of a dif- 
ferent language contact scenario that involves longstanding contact and perhaps some 
bi- or multilingualism. See Hólzl (2017c) for an additional discussion of simplification 
and complexification of Tungusic interrogative systems. 


Table 6.15: Complexification and simplification (Trudgill 2011: 62) 


Complexification Simplification 
irregularization regularization of irregularities 
increase in opacity increase in morphological transparency 


increase in syntagmatic redundancy 
reduction in syntagmatic redundancy 


addition of morphosyntactic categories ` loss of morphological categories 


Tungusic also offers a good example for yet another type of language contact that 
leads to the mixing of languages. The language Kilen, for example, is well-known to be 
a mixed Tungusic language and has been sometimes classified with Nanai (e.g., Alonso 
de la Fuente 2011; Janhunen 2012d; this study) and sometimes with Udihe (Kazama 2003). 
Influence from Manchu has often been overlooked, however (see Holz) 20172). In fact, 
Kilen exhibits interrogatives that can clearly be shown to derive from Nanai, Udihe, and 
Manchu, which represent three different branches of the language family. Consider the 
following example. 


(3) Kilen (Tungusic) 
ni jarin ja-tula ana-kicia? 
who when where-ALL go-?suBJ 


"Who would like to go to where when?’ (Zhang 2013: 163) 


The verb most likely derives from Nanaic, but every interrogative must stem from other 
branches of the language family (Table 6.16). Such a mixed language can only be the re- 
sult of multilingualism throughout the entire speech community: “Unlike creoles, mixed 
languages arise in bilingual settings in which the speakers are equally fluent in the two 
codes.” (Operstein 2015: 6) 

Another example seen in NEA is Copper Island Aleut, which exhibits Russian and 
Aleut interrogatives (§5.4.3). Creoles are both mixed and exhibit “extreme simplification 
on all levels” (McWhorter 2007: 254) due to non-native acquisition. In principle, they 
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Table 6.16: The etymological brackground of the Kilen elements in (3) 


Language who when whither ALL to go 

Manchu (Jurchenic) we ya erin ya-de -de gene- 

Nanai (Nanaic) ui xaali xaosi -dola ənə- 

Udihe (Udegheic) ni ali je-uxi -dulA nene- 

Evenki (Ewenic) ri ookin i-le -dulA/- pene- 
tulA 


should exhibit both a simplified or transparent interrogative system as well as interroga- 
tives from different sources and this indeed seems to be the case for at least some of them 
(Bickerton 2016 [1981]: 65f.; Muysken & Smith 1990). There are no true creole languages 
in NEA, but Taimyr Pidgin and Chinese Pidgin Russian had interrogatives of Russian 
and dialectal Russian origin as well as at least one from Nganasan and Chinese, respec- 
tively (§5.5.3.3). Taimyr Pidgin furthermore had at least some innovative interrogatives 
such as kudy-mera ‘where’, kudy-mesto ‘where’, and kakoj storona ‘whither’. 

Another type of change we see under creolization is the translation of individual forms 
such as Chinese Pidgin Russian mnogo-malo, which consists of Russian mnógo/MHoro 
‘much’, mélo/mano ‘little’ and is a direct translation of Chinese duosháo ^^ ‘how much’. 
Calques are not necessarily restricted to creole and pidgin languages, however, but can 
also be found in instances of bilingual contact. Most cases found in NEA are partial bor- 
rowings and contain an autochthonous interrogative, e.g. Qiang na-tian from Chinese ji 
didn JL, ‘what hour’ (LaPolla & Huang Chenglong 2003: 53f.). A mixture of calque and 
borrowing can also be found in Santa yan shihou from Mandarin shénme shíhou ‘what 
time’. Special cases of borrowing are, furthermore, iama-dzaka ‘what thing’ and perhaps 
iama-arin ‘what time’ in the Tungusic language Kilen, which derive from two different 
sources. The actual interrogative has been borrowed from Udihe je-me ‘what kind’, while 
the second elements derive from Manchu jaka ‘thing’ and perhaps erin ‘time’, both of 
which are also present in Manchu interrogatives. In some cases an interrogative has been 
entirely translated. Manchu ai se-me and Khorchin Mongolian joo ga-j, for example, have 
the same underlying pattern ‘what say-cvB.IPFV and both mean ‘why’. Khalkha xer olon 
or Ket bilon appear to have been formed on the basis of a European pattern also seen in 
English how many/much. 

These examples illustrate that the grammar of questions can indeed function as a pre- 
liminary but valuable tool for the identification of different types of language contact, 
but this section has focused on individual instances of diffusion or convergence, exclu- 
sively. The following maps ofthe atlas allow an additional identification of large patterns 
of areal convergence that is impossible from the study of individual languages alone. 
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6.4 An atlas of the grammar of questions in Northeast 
Asia 


The geographical extent of certain features will be demonstrated with the help of a syn- 
chronic sample of 83 languages (Figure 6.4, Table 6.17) that covers all 14 language families 
of NEA. Languages with a wide geographical distribution are underlined in Figure 6.4 
and shown with bigger symbols in Figure 6.5-Figure 6.16 below. The maps exclude ex- 
tinct languages and list only some dialects of a given language. An exception is made for 
Ainuic, which by now is probably completely extinct but has been added for reasons of 
completeness. 

Given somewhat unclear boundaries between languages and dialects in Japonic, di- 
alectal variation in this family may be slightly overrepresented. It may be noted that the 
lack of data for some languages might have led to some distortions. Nevertheless, the 
general areal patterns seem to be valid. The white line in Figure 6.4 indicates the rough 
definition of NEA adopted in this study. The distribution of the languages clearly shows 
a large spread zone over large parts of Northeast Asia, including Northern China, Mon- 
golia, Siberia, Korea, and Japan (excluding Hokkaido and the Ryükyüan Islands) with 
few but widespread languages. Residual zones with many local languages are found in 
northern Manchuria (including Sakhalin and Hokkaido), the Ryükyüan Islands, the Aleut 
Islands, Amdo, the Altai, along the Yenisei, and perhaps on Kamchatka. 
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Family Language Number 
Ainuic Chitose Ainu 1 
Saru Ainu 2 
Shizunai Ainu 3 
Amuric Amur Nivkh 4 
East Sakhalin Nivkh 5 
Chukotko-Kanchatkan Chukchi 6 
Alutor 7 
Koryak 8 
Itelmen 9 
Eskaleut Central Siberian Yupik 10 
Aleut, Atkan 11 
Copper Island Aleut 12 
Indo-European Russian 13 
Ukrainian 14 
Plautdiitsch 15 
Yiddish 16 
Sarikoli 17 
Japonic Japanese 18 
Hachij6 19 
Yilan Creole 20 
Yuwan 21 
Okinoerabu, Masana 22 
Shuri 23 
Ikema 24 
Ogami 25 
Irabu 26 
Hateruma 27 
Hatoma 28 
Miyara 29 
Sonai 30 
Koreanic Korean 31 
Jeju 32 
Mongolic Dagur 33 
Buryat 34 
Khamnigan Mongol 35 
Cyrillic Khalkha 36 
Oirat 37 
Shira Yughur 38 
Santa 39 
Bonan 40 
Kangjia 41 
Huzhu Mongghul 42 
Minhe Mangghuer 43 


419 


6 Interrogative constructions in Northeast Asia: A summary 


Trans-Himalayan 


Tungusic 


Turkic 


Uralic 


Yeniseic 
Yukaghiric 


Standard Mandarin 
Urumqi Hui Mandarin 
Hezhou 
Tangwang 
Wutun 

Amdo Tibetan, Gonghe 
Baima 

Zhongu 

Even 

Evenki 

Negidal 

Kh. Evenki 
Oroqen, Xunke 
Solon, Huihe 
Udihe 

Kilen (An Jun) 
Nanai 

Ulcha 

Uilta 

Sibe 

Salar 

Kazakh 

Kyrgyz 

Tatar (Chinese) 
Uyghur 

Tuvan 

Tofa 

Khakas 

Sarig Yughur 
Altai Turkic 
Chulym, Middle 
Dolgan 

Yakut 

Nganasan 

Forest Enets 
Tundra Nenets 
Selkup, Taz 

Ket 

Kolyma Yukaghir 
Tundra Yukaghir 


44 
45 
46 
47 
48 
49 
50 
51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 
65 
66 
67 
68 
69 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
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Figure 6.4: Approximate geographical location of the 83 languages in the sam- 


ple (1) 
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Figure 6.5: Polar question marking (2) 


Color Type Languages 
white sentence-final marker 45 
white with black dots mixed 19 
black with white outline suffix(es) 4 
gray with white outline no morphosyntactic marker 6 
white with vertical black lines sentence initial marker 3 
white with horizontal black lines ` prefix(es) 2 
white with diagonal black lines second position marker 2 
black preverbal marker 1 
light gray with black outline word order 1 
Total 9 83 
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Figure 6.6: Sentence-final polar question marker present (3) 


Color Type Languages 
white sentence-final marker 62 
gray with white outline not present 21 
Total 2 83 
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Figure 6.7: Content question marking (4) 


Color Type Languages 
white no morphosyntactic marker or unclear 36 
gray with white outline sentence-final marker 15 
black with white outline suffix(es) 13 
white with black dots mixed 14 
light gray with black outline mobile enclitic 2 
Total 5 83 
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Figure 6.8: Alternative question marking (5) 


Color Type Languages 
white unclear 34 
gray with white outline double marking 21 
white with black dots mixed, other 19 
light gray with black outline single marking 5 
black with white outline disjunction 4 
Total 5 83 
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Figure 6.9: Presence of disjunction in alternative questions (6) 


Color Type Languages 
white unclear or not present 65 
black with white outline present 17 
Total 2 83 
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Figure 6.10: Presence of single marking in alternative questions (7) 


Color Type Languages 
white unclear or not present 72 
light gray present 11 
Total 2 83 
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Figure 6.11: Polar versus content question marking (8) 


Color Type Languages 
white different 61 
gray with white outline identical 13 
white with black dots mixed 9 


Total 3 83 
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Figure 6.12: Polar and content questions overtly marked differently (9) 


Color Type Languages 
white no or unclear 62 
gray with white outline yes 21 
Total 2 83 
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Figure 6.13: Polar versus alternative question marking (10) 


Color 


Type Languages 
gray with white outline ` identical 37 
black with white outline different 5 
white mixed or unclear 41 
Total 3 83 
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Figure 6.14: KIN-interrogatives (11): The interrogative meaning ‘who’ ina given 
language has the form KIN (velar or uvular plosive or fricative, (high) vowel 
(short or long), (apical) nasal), followed by an optional final vowel, e.g. Turkish 
kim, Forest Nenets kim'a, Aleut kiin etc. 


Color Type Languages 
white not present 53 
gray with white outline present 30 
Total 2 83 
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Figure 6.15: K-interrogatives (12): More than two interrogatives in a given lan- 
guage start with the same velar or uvular plosive or fricative, e.g. Nanai xar 
‘what’, xado ‘how many’, xooni ‘how’, Uyghur qaysi ‘which’, qačan ‘when’, 
qandaq ‘how’ etc. 


Color Type Languages 
gray with white outline not present 42 
white present 39 
light gray unclear 2 
Total 3 83 
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6.4 An atlas of the grammar of questions in Northeast Asia 


Figure 6.16: The personal interrogative ‘who’ has a different initial consonant 
from all other interrogatives (13) 


Color Type Languages 
white yes 43 
gray with white outline no 35 
light gray unclear 4 
white with black dots mixed 1 
Total 4 83 
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7 Conclusion 


According to Evans & Levinson (2009: 429), “we are the only species with a communi- 
cation system that is fundamentally variable at all levels.” The investigation of linguistic 
diversity thus should be a major concern of linguistics in general and of typology in 
particular. The main research question of this study was, following Bickel (2007: 248), 
“what’s where why?” 


Asking “what’s where?” targets universal preferences as much as geographical or 
genealogical skewings, and results in probabilistic theories stated over properly 
sampled distributions. Asking “why?” is based on the premises that (i) typological 
distributions are historically grown and (ii) that they are interrelated with other 
distributions. (Bickel 2007: 239) 


Therefore, the present study is not a classical synchronic typological investigation, but 
focused on the distribution and preliminary explanation of linguistic diversity found in 
the limited geographical area of Northeast Asia (NEA), tentatively defined as the area 
north of the Yellow River and east of the Yenisei (e.g., Chard 1974). Another question 
formulated in the Introduction was whether the concept of Northeast Asia makes sense 
from the point of view of areal linguistics. The answer is certainly yes, but with limita- 
tions. The definition of Northeast Asia as a concept strongly depends on its opposition 
with Mainland Southeast Asia (MSEA, Enfield & Comrie 2015). Regarding the number 
of languages (language diversity), NEA with with perhaps up to 150 languages ranks 
much lower than MSEA, which is the home of up to 600 different languages. In terms of 
different linguistic stocks, however, NEA has 14 instead of only 5 found in MSEA (phy- 
logenetic diversity). In comparison, the region of New Guinea is home to approximately 
1200 languages from about 35 language families on an area of only 850,000 km? (Foley 
2000). In NEA, Mongolia alone is larger than that area. There are similarly pronounced 
regional differences in linguistic diversity within Northeast Asia. The highest concen- 
tration of languages can be found in peripheral regions such as the Amdo region, the 
Ryükyüan Islands, in the Amur river shed, and around the Altai extending northwards 
along the Yenisei as well as southward along adjacent mountainous regions. Following 
Nichols (1992; 1997) these can be characterized as residual or accretion zones. Language di- 
versity is at its lowest in central parts around Mongolia, northern China, central Siberia, 
Korea, and central parts of Japan, which qualifies as a large coherent spread zone. Re- 
garding phylogenetic diversity, there is quite a different distribution that peaks around 
the eastern part of NEA along the Pacific Rim (Pacific NEA), where representatives of 
12 of the 14 language families of NEA can be found (e.g., Anderson 2010). Historically, 
however, both Yeniseic and Samoyedic, which are the only exceptions, have been spo- 
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ken further towards the southeast as well. No doubt there is a multitude of reasons for 
these strong differences in linguistic diversity, including climatic (e.g., temperature, pre- 
cipitation), geographical (e.g., landscape roughness, river density), and cultural factors 
(e.g., subsistence patterns, agriculture, pastoralism, hunting and gathering) (e.g., Nichols 
1992; Nettle 1999; Axelsen & Manrubia 2014). Not only is there a complex mixture of dif- 
ferent causes located on different time scales, but the importance of individual factors 
varies from region to region. These factors clearly also influence the size of languages, 
which is greatest in the southeast (Mandarin, Japanese, Korean) and decreases towards 
the west and especially towards the north and seems to be directly correlated with the 
distribution of population density and environmental factors such as climate and vegeta- 
tion, and, consequently, the existence of agriculture. Understandably, the exact causes of 
phylogenetic and language diversity could not be investigated within this study, which 
focused on structural diversity, more precisely the diversity found in the grammar of 
questions, i.e. those aspects of any given language that are specialized for asking ques- 
tions. The primary distinction made in the grammar of questions of a given language is 
in question marking, interrogatives, and optional additional functional domains such as 
coordination, focus, and negation. A comparison of the structural diversity of the gram- 
mar of questions found in MSEA and NEA was not feasible as there are simply too many 
languages to investigate in MSEA. The obvious next step should thus be to expand the 
typology proposed in this study to Mainland Southeast Asia (see Clark 1985; Huang 1996; 
Huang et al. 1999; Enfield 2010; Rajasingh 2014, etc.) and to other regions from around the 
globe. Nevertheless, there is evidence that NEA and MSEA, despite manifold differences 
(Chapter 3), together form one large area with a preponderance of sentence-final polar 
question markers (Dryer 20131). For reasons of space this study necessarily also excluded 
responses and answers, which is yet another avenue for further research (e.g., Enfield 
et al. 2010). Future studies should also pay more attention to intonation in questions, 
which was for the most part neglected here for mere lack of information (e.g., Sicoli et al. 
2014 and references therein). However, this study identified many important aspects of 
the grammar of questions in NEA and beyond, ranging from general principles (Chapter 
4) to specific aspects of the languages of NEA (Chapters 5, 6). Given the focus on one 
area, the typology of questions presented in this study was necessarily limited. I intend 
to elaborate on it in future studies with a global coverage. For example, the exact distri- 
bution and explanation of KIN- and K-interrogatives can only be settled with the help 
of a global sample of languages. The total discussion mentions over 450 languages and 
dialects from NEA and beyond (see the Language Index). Altogether about 900 glossed 
examples were given. The aim was to achieve both a cross-linguistically plausible typol- 
ogy and a resolution of the linguistic diversity of Northeast Asia as much as possible 
(Voegelin & Voegelin 1964: 2). 

Chapters 3 and 6 identified several important areal features such as KIN- and K- 
interrogatives that have a strong basis in Northeast Asia as well as more localized in- 
stances of diffusion and convergence such as in the so-called Amdo Sprachbund. Con- 
cerning the grammar of questions, the Tungusic languages play a less important role for 
NEA than was assumed in 83.4. However, there is no point in arguing whether Northeast 
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Asia qualifies as a clear-cut linguistic area, given the problematic status of the concept it- 
self (e.g., Campbell 2006). In terms of structural diversity, Northeast Asia admittedly has 
a relatively clear boundary towards the southeast, i.e. Mainland Southeast Asia (e.g., En- 
field & Comrie 2015), but not towards the west (e.g., Heggarty & Renfrew 2014a). While 
there are certain features such as the existence of front rounded vowels that are relatively 
widespread in NEA, these can often also be found in the adjacent regions towards the 
west, such as Europe. The reason for this seems to be in the fact that NEA over millennia 
was the starting point for a multitude of population movements and linguistic spreads 
over all of northern Eurasia towards the west (e.g., Nichols 1997: 376f.). Another major 
direction of spread was from southern NEA towards the north, often following the rivers 
Yenisei and Lena (e.g., Skribnik 2004: 151). Not only do all three large language families 
of Europe, Indo-European, Uralic, and Turkic, derive from a location further to the east 
or even from NEA, but the ancestors of all native Americans and their languages neces- 
sarily had their origin within NEA as well (e.g., Llamas et al. 2016 and references therein). 
Northeast Asia thus holds a key position for regions as far apart as western Europe and 
the Americas. One of the best examples for the importance of especially southern NEA 
for the dispersal of peoples and languages is the recent discovery of the so-called Mal'ta 
specimen found near lake Baikal that is about 24,000 years old (Raghavan, Skoglund, et 
al. 2014). It represents a population called the Ancient North Eurasians that lack a closer 
relation to modern East Asians. Instead, Ancient North Eurasians are one of four major 
founding lineages thus far identified for modern Europeans in the west (Jones et al. 2015), 
and also significantly contributed to the genome of the Kets along the middle Yenisei in 
the north (Flegontov et al. 2016) as well as of native Americans that initially spread to- 
wards Beringia in the northeast (Raghavan, Skoglund, et al. 2014). Even though the time 
scales involved are too large to be accessible through historical linguistics, such popula- 
tion movements certainly were also connected with the spread of languages. Take the 
Yamnaya culture in the Pontic-Caspian steppe, for example, which is thought to have 
brought both the ANE genome as well as the Indo-European languages into Europe (An- 
thony 2007; Anthony & Ringe 2015; Allentoft et al. 2015; Haak et al. 2015; Jones et al. 
2015). While NEA played a crucial role in the spread of populations to other parts of the 
world, it was itself reached by populations and thus most likely by languages from as 
far south as southern China (Hong Shi et al. 2013) and Southeast Asia or perhaps Aus- 
traliasia (Raghavan et al. 2015; Skoglund et al. 2015; Reich 2018: 176-181), which again 
left traces as far apart as northern and eastern Europe and South America, respectively. 
These results have potential implications for the search of long-term relations between 
languages that cannot be restricted to NEA alone. 

The title of this study promised an ecological perspective and the Introduction tenta- 
tively identified the approach as a so-called ecological typology. This approach shares its 
appreciation of human and linguistic diversity with several other approaches (e.g., Evans 
& Levinson 2009; Levinson 2012b), but in addition follows the so-called ecological com- 
mitment (Hólzl 2015d: 186) that the description of language “should be reconcilable with 
what is known from ecological research", which was formulated in analogy to the well- 
known cognitive commitment that continues to define Cognitive Linguistics (e.g., Evans 
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2012). While the cognitive approach sees “language as an integral part of cognition” (Lan- 
gacker 2008: 539), the ecological approach—and what was tentatively called ecological 
typology is only a part of it—in my interpretation sees language as an integral part of ecol- 
ogy, ie. the organism-environment system (e.g., Jarvilehto 1998; Odling-Smee & Laland 
2009). A similarity of the two approaches is the attempt to find explanations in general 
principles (Hólzl 2015b: 185), cf. the generalization commitment in Cognitive Linguistics 
(e.g., Evans 2012). In my eyes, ecology is a valuable cover term for an emerging field of 
investigations that, for the explanation of linguistic diversity and language structure, ac- 
knowledges a multitude of different reasons (e.g., Steffensen & Fill 2014; Bickel 2015; De 
Busser 2015) that take effect on different time scales or causal frames (e.g., Enfield 2014). 
This conceptual shift promises deep implications of which not even the surface could 
be scratched by this study. Linguistic diversity cannot be considered independently of a 
multitude of factors, ranging from the invention of the wheel, over the domestication of 
the reindeer or the biochemistry of the brain, up to the amount of precipitation. 

In one sense that was emphasized throughout this book, ecology “represents a shift of 
emphasis from a single language in isolation to many languages in contact.’ (Voegelin 
& Voegelin 1964: 2) Following Steffensen & Fill (2014), this was called symbolic ecology. 
The subheading An ecological perspective thus mainly refers to the aspect of language 
contact within the entire linguistic landscape of Northeast Asia. The influence of other 
ecologies such as those mentioned in the Introduction (e.g., cognitive, natural, sociocul- 
tural) are only beginning to be understood and consequently had a subordinate position 
(e.g., De Busser 2015). Nevertheless, there are indications that these influences should not 
be underestimated and deserve further research (e.g., Axelsen & Manrubia 2014; Everett 
et al. 2016). An investigation of the impact of climate, for instance, is necessarily based 
on a global sample of languages which could not be achieved within this regional study. 
However, a comparison of the results for NEA in this study and a global sample by Dryer 
(2013j) suggests a possible climatic influence on question marking and especially intona- 
tion: The lack of languages in NEA that mark polar questions with intonation alone and 
do not have further question marking strategies (but see §5.3.2) could be attributed to 
the fact that, for some reason, such languages are usually located in the tropics. In fact, 
Everett et al. (2015: 1322) recently found more convincing evidence for a possible climatic 
influence on language structure: 


The sound systems of human languages are not generally thought to be ecologically 
adaptive. We offer the most extensive evidence to date that such systems are in fact 
adaptive and can be influenced, at least in some respects, by climatic factors. Based 
on a survey of laryngology data demonstrating the deleterious effects of aridity on 
vocal cord movement, we predict that complex tone patterns should be relatively 
unlikely to evolve in arid [and cold] climates. 


In many cases such as this there may be several reasons for a certain phenomenon. Con- 
cerning the occurrence of tones in MSEA but not in NEA there are further possible 
explanations, including language contact or even the occurrence of certain genes (Dediu 
2011). Of course, a language can only mark questions with the help of tones if the lan- 
guage possesses tones in the first place (Hyman & Leben 2000: 593). 
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Additionally, §4.4 has tentatively proposed an ecological theory of questions, which 
describes them as a form of exploratory behavior (e.g., Gibson 1988) in the dialogical ar- 
ray (Gibson 1979; Hodges 2009). This exploration can be explained with specific epistemic 
curiosity (Berlyne 1954; Loewenstein 1994), which itself is evoked by so-called “collative” 
(ie. novel, changing, complex, conflicting, surprising, or uncertain, Berlyne 1978) prop- 
erties of the organism-environment system (Jarvilehto 1998; Turvey 2009). Humans seek 
comprehension and clarity, and there are several ways of achieving this, including men- 
tal problem solving, physical exploration, or asking questions. However, like other types 
of exploratory behavior, questions are a proactive process (e.g., Gibson 1988: 5f.). Ques- 
tions are not merely a request for information, but crucially involve predictions by the 
speaker and thus depend on our previous experience. 
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Appendix A: Data for geographical 
maps 


Numbers refer to maps in the atlas (§6.4). Abbreviations: 2PE = second position enclitic, 
COP = special copula, D = double marking, dif = different, disj = disjunction, FRV = front 
rounded vowel, H = high, id = identical, int =intonation, M = mid, ME = mobile enclitic, 
S = single marking, SFM = sentence final marker, SIP sentence initial particle. 


A Data for geographical maps 
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30 


Language 


Chitose 
Ainu 

Saru Ainu 
Shizunai 
Ainu 


Amur 
Nivkh 
East 
Sakhalin 
Nivkh 
Chukchi 
Alutor 
Koryak 
Itelmen 
CSY 
Aleut, 
Atkan 
CIA 
Russian 
Ukrainian 
Plautdiitsch 
Yiddish 
Sarikoli 
Japanese 
Hachijo 
Yilan 
Creole 
Yuwan 
Okinoerabu, 
Masana 
Shuri 
Ikema 
Ogami 
Irabu 
Hateruma 
Hatoma 
Miyara 


Sonai 


#V 

#V + SIP 
SFM 
SFM 
?SFM 
SFM 


SFM, suffix 
SFM + 
suffix 
SFM, suffix 
SFM 

SFM, suffix 
SFM 


SFM 


CQ 
(4) 
SFM 


SFM, COP 
SFM, COP 


ME 


ME 


suffix 

SFM 

SFM, suffix 
SFM 

-, SFM 
unclear 
(lack of IND 
-N) 

SFM 


dif 


31 
32 
33 
34 
35 


36 


37 
38 


39 
40 
41 
42 


43 


44 


45 


46 
47 
48 
49 


50 
51 
52 
53 
54 


55 
56 


57 


58 


59 
60 


Language 


Korean 
Jeju 
Dagur 
Buryat 
Khamnigan 
Mongol 
Cyrillic 
Khalkha 
Oirat 
Shira 
Yughur 
Santa 
Bonan 
Kangjia 
Huzhu 
Mongghul 
Minhe 
Mangghuer 
Standard 
Mandarin 
Urumqi 
Hui 
Mandarin 
Hezhou 
Tangwang 
Wutun 
Amdo 
Tibetan, 
Gonghe 
Baima 
Zhongu 
Even 
Evenki 
Negidal 


Kh. Evenki 
Orogen, 
Xunke 
Solon, 
Huihe 
Udihe 
Kilen 
Nanai 


PQ 
(2, 3) 
suffix 
suffix 
SFM 
SFM 
SFM 


SFM 


SFM 
SFM 


SFM, suffix 
SFM, suffix 
SFM, suffix 
SFM, suffix 


SFM, suffix 
SFM 


SFM 


SFM 
SFM 
SFM 
SFM, prefix 


prefix 
prefix 
SFM 
SFM 
SFM + 
suffix 
SFM 
SFM 


SFM 
SFM 


SFM 
SFM 


disj 


S (+ disj) 


S/D (+ disj) 
» 


mixed 


?dif 
mixed 
id 


mixed 
mixed 


id 
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A Data for geographical maps 


Language PQ CQ AQ PQ/CQ PQ vs. AQ 
(2, 3) (4) (5, 6, 7) (8, 9) (10) 
61 Ulcha SFM -, 2SFM D mixed id 
62  Uilta SFM SFM ? dif ? 
63  Sibe SFM -, 2SFM D mixed id 
64 Salar SFM suffix ? dif ? 
65 Kazakh SFM, suffix  - D (+ disj) dif id 
66 Kyrgyz SFM - ? dif ? 
67  Tatar SFM - ? dif ? 
(Chinese) 
68 Uyghur SFM, suffix  - D/S (+ disj) dif id 
69 Tuvan SFM suffix ? dif ? 
70  Tofa SFM suffix ? dif ? 
71  Khakas SFM - ? dif ? 
72 Sarig SFM,suffix  - D (+ disj) dif id 
73 SFM - ? dif ? 
Altai 
74  Chulym, SFM - ? dif ? 
Middle 
75 Dolgan SFM suffix D dif id 
76 Yakut SFM suffix D dif id 
77 Nganasan suffix suffix D id id 
78 Forest -, suffix -, suffix ? id ? 
Enets 
79 Tundra SFM, -, -, suffix D, - mixed id 
Nenets suffix 
80 Selkup, Taz preverbal - D dif id 
81 Ket 2PE - S/D, disj dif mixed 
82 SFM suffix D dif id 
Kolyma 
Yukaghir 
83 Tundra - suffix disj dif mixed 
Yukaghir 


1Question marking here excludes disjunctions and the difference between single and double marking is 
ignored. 


444 


Language KIN K~ who FRV PQ vs. Decl 
(11) (12) (13) 
1 Chitose + + - none 4 
Ainu 
2 Saru Ainu - none 4 
3  Shizunai + - ?none 4 
Ainu 
4 - - + none 4 
Amur 
Nivkh 
5 East - - - none 4 
Sakhalin 
Nivkh 
6 Chukchi - - - none 4 (int) 
7 Alutor - - none 4 
8 Koryak - - - ? 4 
9 Itelmen + - + none 4 (int) 
10 CSY + + - none 3 
11 = Aleut, + + + none 4 
Atkan 
12 CIA + + - ? ?4 (int) 
13 Russian - * - none 4 
14 Ukrainian - + + none 4 
15 Plautdiitsch  - - - ? 4 
16 Yiddish - - - none 4 
17  Sarikoli - ? - none 4 
18 Japanese - - E none 4 
19 Hachijō - - - ? ?4 
20 Yilan - - + none 4 
Creole 
21 Yuwan - - + none 4 
22  Okinoerabu, - - 4 none 4 
Masana 
23  Shuri - - + none 4 
24 Ikema - - + none 4 
25 Ogami - : * none 4 
26 Irabu - - + none 4 
27  Hateruma - - + none 4 
28 | Hatoma - - * none 4 (int) 
29  Miyara - - * none 4 (int) 
30  Sonai - - + none 4 


445 


A Data for geographical maps 
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Language KIN K~ who FRV PQ vs. Decl 
(11) (12) (13) 
31 Korean - + ?H+M 3 
32 Jeju - - ?+ ? 3 
33 Dagur + + - none 4 
34 Buryat + + - H+M 4 
35 Khamnigan + + - ?H 4 
Mongol 
36 Cyrillic + + - none 4 
Khalkha 
37  Oirat + - H+M 4 
38 Shira + - H+M 4 
Yughur 
39 Santa + + none 4 
40  Bonan + - 2M 4 
41 Kangjia + - + ?M«H 4 
42  Huzhu + + - none 4 
Mongghul 
43 Minhe + + - none 4 
Mangghuer 
44 Standard - - - H 4 
Mandarin 
45 Urumqi - - e H 4 
Hui 
Mandarin 
Ap  Hezhou - - ? H 4 
47  Tangwang a - = H 4 
48  Wutun - - - none 4 
49 Amdo - - + H 4 
Tibetan, 
Gonghe 
50 Baima - 2+ + ?H+M 4 
51 Zhongu - - + none 4 
52 Even - - + ?none 4 
53 Evenki - - + none 4 
54  Negidal - - * ? 4 
55 Kh. Evenki - - + ?none 4 
56 Orogen, z S + H 4 
Xunke 
57 Solon, - - +/- M+H 4 
Huihe 
58 Udihe - - M+H 4 
59 Kilen - M+H 4 
60 Nanai - none 4 


who4 FRV PQ vs. Decl 


Language KIN K~ 
(11) (12) (13) 
61 Ulcha - + + ? 4 
62 Uilta - + + M 4 
63 Sibe - - + H+M 4 
64  Salar + + + H+M 4 
65 Kazakh + + + H+M 4 
66 Kyrgyz + + + H+M 4 
67 Tatar + + + H+M 4 
(Chinese) 
68 Uyghur + + + H+M 4 
69 Tuvan + + - H+M 4 
70 Tofa + + P+ H+M 4 
71  Khakas 4 + H+M 4 
72 Sarig + + + H+M 4 
73 + + ?- H+M 4 
Altai 
74 Chulym + + + H+M 4 
75 Dolgan + + - H+M 4 
76 Yakut + + + H+M 4 
7! Nganasan E + + H 3 
78 Forest - + + none mixed 
Enets 
79 Tundra ?- + - none mixed 
Nenets 
80 Selkup, Taz  - + - ?H+M 4 
81 Ket - - - none 4 
82 + + + M 4 
Kolyma 
Yukaghir 
83 Tundra + + + M 4 (int) 
Yukaghir 


*The personal interrogative has a different initial consonant from all other interrogatives. 
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Zhao Yong-Bin, 43 

Zhaonasitu, 232, 241, 251 

Zhong Jinwen, 52, 280 

Zhong Suchun, 222, 246 

Zhu Yongzhong, 240, 267, 281 

Zhukova, A. N., 122 

Zikmundová, Veronika, 308, 329 

Zwicky, Arnold M., 54, 60, 141 


509 


Language index 


Abui, 81, 94, 412 
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Ainuic, 15, 16, 18-21, 25, 45, 46, 49, 50, 
103, 109, 111, 169, 244, 354, 398, 
401, 405, 418 

Alchuka, v, 16, 51, 213, 284, 286, 311, 313- 
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193, 205, 213, 244, 283, 354, 391, 
397, 398, 401, 413, 418, 419 

Jeju, 25, 200, 201, 207—209, 214-216, 395, 
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Koryak, 20, 121, 122, 124—128, 397, 400, 
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Kurux, 73, 406 
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Manchu, 7, 27, 30, 31, 38, 43, 49-52, 
60, 76, 77, 83-86, 126, 213, 
215, 223, 228, 247, 284-286, 
297, 298, 300, 30076, 305-311, 
311°°, 312-318, 320, 322, 324- 
330, 398, 402-404, 407-409, 
411, 413, 415-417 

Manchu-Tungusic, 30, see Tungusic 

Manchuic, 284, 286, 327 


Mandarin, v, 11, 13, 15, 29, 31, 32, 38, 41, 
43, 46, 47, 49, 50, 60, 68, 70, 
73, 79, 83, 147, 147!°, 148, 198, 
199, 224, 229, 234, 237, 247, 
251, 254-263, 26377, 264-266, 
270, 272-275, 278, 280, 307- 
310, 312, 335, 337, 397, 399- 
402, 404, 406-411, 413, 415, 
417, 420, 436, 443, 446 

Mangghuer, 218, 234, 237, 240—243, 245, 
250, 254, 402, 404, 413, 415, 
419, 443, 446 

Mansi, 40, 364 

Mansic, 40, 363 

Mari, 46, 364 

Mariic, 363 

Mator, 33, 363, 369-371, 375 

Mauwake, 60, 67, 68 

Mednyj Aleut, see Copper Island Aleut 
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Middle Korean, 211, 212, 215, 216, 398, 
404, 406, 415 

Middle Mongol, 27, 50, 168, 221, 225, 227, 
238, 244, 333, 397 

Miyara, 166, 185, 187-193, 195, 398, 399, 
401, 419, 442, 445 

Moghol, 26, 27, 163", 217, 218, 224, 242, 
243, 245 

Mon, 65 

Mongghul, 218, 237-243, 245, 253, 254, 
415, 419, 443, 446 

Mongolian, 15, 24, 27, 52, 93, 168, 217, 
219, 221, 222, 224-232, 243- 
245, 247, 249, 250, 284, 291- 
295, 302, 308, 334, 347, 38077, 
396, 404, 407—409, 411, 417 

Mongolic, v, 16, 18, 26, 27, 31-33, 35, 
43, 46, 48-52, 93, 106, 119, 119?, 
147°, 205, 213, 217-220, 222- 
224, 226, 227, 231, 232, 236, 
240, 242, 244-246, 251, 255, 
263, 265, 269, 281, 287, 288, 
291, 293, 303, 308, 316, 319, 322, 
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331, 334, 347, 352, 355, 360, 
398, 400—402, 404-406, 413, 
415 

Mordvin, 364 

Mordvinic, 363 

Muskogean, 64 

Muót, 76 


Na-Dene, 15-17, 34, 57, 383-385 

Nagahama, 190 

Nakachi, 166, 183 

Nanai, 6, 16, 30, 39, 79, 86, 246, 285, 295, 
299, 300, 30076, 302, 305, 312- 
318, 324-326, 328, 405, 413, 
416, 417, 420, 443, 446 

Nanaic, 30, 284, 285, 301, 302, 312, 315, 
316, 319, 323, 326, 416, 417 

Naukan, 21, 129, 132, 134-137 

Naukanski, see Naukan 

Navajo, 384, 385 

Negidal, 20, 44, 52, 136, 231, 285, 289, 
294-296, 302, 312-314, 317, 
318, 320, 370, 392, 396, 403, 
404, 420, 443, 446 

Nembe, 64 

Nenets, 33, 122, 348, 363, 364, 367-371, 
373-375, 395, 397, 400, 403, 
405, 408, 409, 420, 444, 447 

Nganasan, 16, 33, 35, 136, 148, 159, 161, 
363-367, 370, 371, 374, 375, 
395, 397, 400, 403, 413, 417, 
420, 444, 447 

Niger-Congo, 15, 64, 65, 67, 73 

Nivkh, 15, 16, 19, 20, 31, 39, 48, 52, 113- 
117, 119, 120, 125, 304, 319, 325, 
399, 402-404, 411, 413, 419, 
442, 445 

Nuristani, 23, 140 

Nymylan, see Itelmen 

Nyulnyulan, 58, 71 


Ob-Ugric, 40 
Odul, see Yukaghir 
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Ogami, 166, 176, 181, 182, 187, 188, 190, 
192, 193, 198, 228, 398, 400, 401, 
403, 419, 442, 445 

Oghur, 32, 332, 354, 405 

Oghuz, 32, 331-334, 354 

Oirat, 15, 26, 217, 218, 230, 231, 243-245, 
250, 400, 403, 419, 443, 446 

Okinawan, see Shuri 

Okinoerabu, 166, 176, 181, 183, 185, 186, 
188, 190, 193, 194, 196, 403, 419, 
442, 445 

Old Chinese, 10, 29, 46, 47, 256, 273-275, 
410 

Old Japanese, 25, 39, 109, 166, 168-171, 
179, 187, 188, 190—196, 214, 216, 
330, 333, 397, 404 

Old Korean, 213 

Old Ryükyüan, 169, 188, 190, 192, 216 

Old Tibetan, 29 

Ometo, 55 

Omok, 386 

Omotic, 12, 75, 76 

Ongan, 44 

Ordos, 218, 230, 232, 243, 245, 247, 250 

Oroch, 285, 298, 299, 312-317, 324, 325, 
413 

Orogen, v, 27, 32, 44, 51, 52, 223, 225, 229, 
247, 284, 285, 291-294, 298, 
306, 312-314, 318, 320, 322- 
324, 327, 397, 401, 403, 404, 
410, 413, 420, 443, 446 

Ostyak-Samoyed, see Selkup 


Paleo-Asiatic, 15 
Paleo-Eskimo, 17 
Paleo-Siberian, see Paleo-Asiatic 
Palula, 66, 73, 149-151 
Pama-Nyungan, 85 
Para-Ainuic, 103 
Para-Japonic, 25, 26, 165 
Para-Koreanic, 26 
Para-Mongolic, 26, 27, 31, 52 
Para-Yeniseic, 377 

Permic, 363 


516 


Persian, 16, 24, 140, 149, 163, 1634, 331, 
340, 343 

Pichis Ashéninca, 55, 87 

Plautdiitsch, 24, 141, 142, 144, 1448, 152- 
154, 156-158, 396, 397, 419, 442, 
445 

Polish, 141, 154 

Portuguese, 16 

Prakrit, 16, 23 

pre-modern Japanese, 187, 189, 190, 192- 
194 

Prinmi, 272 

Proto-Indo-European, 152, 158 

Proto-Japonic, 189, 191 

Proto-Ainuic, 46, 47, 111 

Proto-Austronesian, 191 

Proto-Dene-Yeniseian, 34 

Proto-Eskaleut, 22, 46 

Proto-Eskimo, 138 

Proto-Indo-European, 6, 22, 85, 153, 373 

Proto-Japonic, 46, 169, 171, 188, 189, 192, 
216 

Proto-Mongolic, 6, 26, 27, 219, 250, 254 

Proto-Ryükyüan, 191, 192 

Proto-Samoyedic, 33, 46, 371 

Proto-Sino-Tibetan, see  Proto-Trans- 
Himalayan 

Proto-Slavic, 159 

Proto-Tibetic, 281 

Proto-Tungusic, 30, 46, 51, 220, 221, 247, 
287, 312, 314, 316, 317, 325, 333, 
410 

Proto-Turkic, 6, 46, 354 

Proto-Uralic, 33, 46 

Proto-Yeniseic, 34, 377, 382, 383 

Proto-Yukaghiric, 6, 35, 46, 392 

Proto-Yupik-Sirenik, 138 

Pumpokol, 34, 377 


Qiang, 75, 100, 272, 283, 417 

Qiangic, 28-30, 66, 75, 255, 270—272, 281, 
283 

Quechua, 74 


rGyalrong, 271, 272 

Rouran, 16, 50 

Rukai, 68 

Russian, v, 6, 12, 15, 16, 19-22, 24, 27, 29, 
32, 34, 35, 44, 49, 128, 130, 1307, 
137, 140, 145-148, 152, 153, 158- 
162, 247, 288, 298, 302, 3037", 
30478, 305, 350, 352, 360, 369, 
375, 378, 380, 382, 387, 396, 
397, 402, 404, 413, 416, 417, 419, 
442, 445 

Ryükyüan, 16, 25, 165, 169-171, 173, 179, 
181, 184, 187, 190—197, 396, 398, 
401 


Saami, 64, 81, 88, 364 

Saamic, 363 

Sabané, 61, 62 

Saisiyat, 65, 66 

Saka, 24, 148 

Salar, 265, 331, 332, 334, 335, 352-354, 
356, 358, 400, 403, 404, 414, 
420, 444, 447 

Salish, 64 

Samagir, 312 

Samar, 300, 312 

Samoyedic, 31, 33, 35, 45, 48, 128, 137, 
348, 363, 364, 366, 370, 371, 
373, 375, 396, 405, 435 

Sanskrit, 16, 23, 151 

Santa, 15, 217, 218, 234, 236, 237, 243- 
245, 250-252, 402, 404, 413, 
415, 417, 419, 443, 446 

Sanuma, 61 

Sarig Yughur, 244, 331, 349, 353, 356, 357, 
360, 397, 401, 403, 404, 414, 
420 

Sarikoli, 24, 140, 149, 150, 152, 163, 347, 
397, 404, 419, 442, 445 

Selkup, 33, 34, 151, 355, 363, 364, 369, 
370, 375, 379, 384, 396, 400, 
413, 420, 444, 447 

Serbi-Mongolic, 27 

Sheko, 61, 62, 82 
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Shira Yughur, 217, 218, 232, 243, 245, 250, 
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A typology of questions in 
Northeast Asia and beyond 


This study investigates the distribution of linguistic and specifically structural diversity in North- 
east Asia (NEA), defined as the region north of the Yellow River and east of the Yenisei. In par- 
ticular, it analyzes what is called the grammar of questions (GQ), i.e., those aspects of any given 
language that are specialized for asking questions or regularly combine with these. The bulk of 
the study is a bottom-up description and comparison of GQs in the languages of NEA. The addi- 
tion of the phrase and beyond to the title of this study serves two purposes. First, languages such 
as Turkish and Chuvash are included, although they are spoken outside of NEA, since they have 
ties to (or even originated in) the region. Second, despite its focus on one area, the typology is 
intended to be applicable to other languages as well. Therefore, it makes extensive use of data 
from languages outside of NEA. The restriction to one category is necessary for reasons of space 
and clarity, and the process of zooming in on one region allows a higher resolution and historical 
accuracy than is usually the case in linguistic typology. The discussion mentions over 450 lan- 
guages and dialects from NEA and beyond and gives about 900 glossed examples. The aim is to 
achieve both a cross-linguistically plausible typology and a maximal resolution of the linguistic 
diversity of Northeast Asia. 
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